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ABSTRACT 

Two longitudinal studies of expert- level cognitive 
skills and their acquisition are reported. The first study focused on 
the skill of an otherwise normal subject who increased his digit-span 
to over 100 digits with 4.5 years of practice using the mnemonic 
system developed by V. G. cr.^se and K. A. Ericsson. The second study 
followed two college undergraduates who became mental calculation 
experts by practicing for 3 to 4 years with strategies used by 
previously studied experts. The studies relied on the tenets of 
skilled memory theory. Observations of subjects' learning and 
performance suggest that acquired knowledge , rather than exceptional 
native talents or general aptitudes , constitutes the foundation of 
expertise, and that the knowledge structures and processing that 
support expertise can be analyzed. Theoretical analyses reveal that 
rapid and reliable storage and retrieval of information in lung-term 
memory (LTM) appears to be a key characteristic of experts* 
informcition processing. Acquired memory skills enable experts ♦•o use 
LTM in order to r*- ^uce the constraints that a limited working lu .nory 
capacity imposes on performance in complex tasks. The studies also 
document the effects of efficient information-processing strategies 
on cognitive skill acquisition and expert performance. The report 
distribution list is appendeo. A 5-item summary of publications, a 
92-item list of references, 11 figures, and 9 tables are included. 
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L Introduction 

This is the final import of a project invcstigatmg the naiurc and (Jevelopmcnt of oipcrt-lcvel 
cognitrvi^ skills with tiic mppon of ONR C<mtract N00014-85-K-0524. It summarizes the 
findings of two Icmgitudinal studies of cognitive skill acquisition, m which the skills of the 
participating individuals were subjected to detailed cognitive analyses. 

The accomplishments of this project fit into three broad categories. First, it has produced 
two rich databases on human expertise dut trace the transition firom the novice level to some of 
the iii^iest levels of skilled perfonnance systematically observed. Second, die analyses of the 
experts* performance have added depth and breadth to our understanding of die knowledge and 
memory dynamics siqyporting excq>tional levels of hanan performance. Hkird, it has 
<temonstrated the dieoretical and practical value of die general approach to Imowledge 
engineering** employed by this project 

This repon first reviews the general issues motivating this research project and its 
objectives. Subsequent secti<ms summarize its main empirical, dieoretical, and methodological 
contributions. The final section discusses some practical implications of this work. 

2. Issues and Objectives: Expertise, Knowledge, and Skilled Memory 

The f mdies described here were motivated by several related issues. The objectives of tSiis 
project are reviewed in die context of these issues. 

The most general goal was to refine our knowledge of what enables experts to perform 
demanding cognitive tasks with such extraordinary proficiency that most pecfde believe they 
possess tatents not found in the "normal** pqnilation. From an information-processing 
per^)ective, understanding such excq)tional performance involves asking questicms such as: 
How can we characterize experts* mformaticm-processirg capabilities? What qualitative and/cH* 
quantiUiHve differences account for the tyirically large disparity between the performance of 
expens and novices? The general phrasing of these questions assumes that certain invariants 
characterize expertise and its cognitive substrate across the vanety cf tasks or domains in which it 
is demonstrated. Ccmsidering the fundamedtally adaptive nature of human expertise, however, it 
is important to question this assumption and ask if expertise has any general, context-fiee 
characteristics. If so, what mediods might be most suited to detect them? This project addressed 
these issues by analyzing the cognitive structures and processes, particulariy those related to 



memory, that support expert*level cognitive skills and their development in two domains, 
mnemonic skill and mental arithmetic. 

The second general goal was to analyze and describe the knowledge that siqyports expert 
level performance in the skills studied Three distinct liims of prior research direct investigation 
to expert knowledge, its representation, its use, and its acquisition* First, tl^re is the skilled 
memory effect, the superior memory experts typically exhibit for material found in their realm of 
expertise. The very restricted nature of tteir memory advantage relative to novice controls links 
this effect to Oieir pior experience in a particular domain. The interpretation &at a hig^y- 
organized, domain*specific body of acquired knowledge accounts for Ms effect has received 
substantial empirical support (cf. Chase, 1986; Simon, 1979; Tech. Kept. 89-2). 

Second, the idea that knowledge is the foundation of expertise has been supported by the 
capabilities of computer programs known as expert systems. Essmtialty, these programs codify 
and use knowledge extracted from human experts in domains such as medical diagnosis, chemical 
analysis, and computer system configuration to perform problem*solving and decision-making 
tasks at levels at or near those of human experts (Duda & Shortliffe, 1983). Although these 
systems may not represent, store, and operate upon this knowledge in ways duit human experts 
do, their capabilities show that the knowledge at their core is sufficient to produce expert 
performance (Feigenbaum, 1989). Third, Chase & Ericsson*s (1981, 1982; Ericsson, C se, & 
Falocm, 1980) longitudinal study of the acquisition of mnononic expertise directly linked 
acquired knowledge to unquestionably excq>tional perfonnacce. Collectively, studies of the 
skilled memory effert expert systems, and the acquisition of cognitive skill converge upon the 
conchision that knowledge, rather dian general aptitudes or talents is the foundation of expertise. 

This conchision leads to a theoretical problem and another objective: to identify and 
describe memory structures and processes that mediate experts* application of knowledge. This 
problem is called the ''paradox of expertise'' (Barsalou & Bower, 1984; Smith, Adams, & Schorr, 
1978; Pomer, 1988). The issue is to reconcile experts' effective and efficient use of a large 
knowledge base with die weO-documented Innitations that components of the human mem(^ 
system impose upon information processing capacity (Miller, 1956; Shiffrin, 1976; Simon, 1976). 
This issue lies at die heart of Sunon & Chase's dieory of esqpert skill (Cliase. 1986; Chase & 
Simcm, 1973a, 1973b; Simon & (3iase, 1973; Simon & Gihnartin, 1973; Simon, 1979) and its 
offspring, the firework of dieoretical principles that consititute skilled memory Jieory (SMT) 
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(Cbasc & Ericsson, 1982; Technical Report 89-1). Sharing several key assumptions of its 
precursor, SMT seeks to explain aspects of vtxpert performance related to memory dynamics for 
which the Chase-Simon theory could not accoimt. Key objectives of this project were to evaluate 
SMT's explanation of the paradox of expertise, particulariy its generality, and becter understand 
the cognitive structures and {mxresses that implement skilled memr 7 in specific expert skills. 

2 J* Principles of Skilled Memory 

SNfT postulates that expert-level performance dq)ends upon expends' efficient use of a vast, 
domain-specific knowledge base. This implies that an expert's knowledge base contains more 
than just ccmtent knowledge; becoming an expert also involves developing memory management 
sidlls. SMT asserts that through extensive practice in a particular domain, experts acqut^e 
knowledge structures and procedures for efficiently encoding and retrieving task-relevant 
infcnmation in long4erm memory (LTM). Three general principles descrilie how experts use 
memory efficiently to excel in their partioilar domains. 

The Mnemonic Storage Principle states that experts use abstract, semantic monoiy 
structures developed through extensive experience to quickly recognize and encode familiar 
patterns of Mormation and maintam that information for later use. This i»±x:iple essentially 
states that chunking is a mechanism exp^ use to process large amounts of information in a 
limited capacity STM. Studies of expert mnemonists (Oiase & Ericsson, 1981. 1982; Eticsson, 
Chase, & Faloon, 1980) and experts from domams such as chess (Oiase & Simon, 1973a, 1973b), 
trridge (Chamess, 1979;, the games of go (Reitman, 1976) and gomuku (Eisenstadt & Kareev, 
1975), electronics (Egan & Schwartz, 1979), architecture (Akin, 1982), and ccmiputer 
programming (McKeithen, Reitman, Rueter, & Hirtle, 1981) svppoii this generalization. All 
suggest that experts use knowledge acquired through years of practice in a particular domain and 
stored as organized units in LTM to encode large anounts of information economically. 

The Retrieval Structure Principle states that experts use their acquired understanding of 
the material and tasks of a particular domain to create mechanisms for indexing chunks of 
infonnaticm m LTM in a way that facilitates thek orderly recovery. Retrieval structures are 
memory mechanisms that govern organized storage and retrieval of content informaticHi by 
'^addressing'* information at the time of storage to provide a systematic means of later retrieving 
it Their function is analogous to that served by catalogue systems used in libraries or filing 
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systems used in offices. Retrieval structures rein^esent SMT's solution to tht problems of 
explaining the large number of chunks (sometimes in excess of the storage c^^acity of STM) that 
experts can quickly store and easily retrieve in a prescribed order. 

Finally* the Speed-Up Principle states that the speed and reliabiliQr of both memory 
encoding (i.e.» chunking and LTM indexing) and retrieval processes increases with practice. 
Assuming diat LTM storage and retrieval times decrease continuously with practice (Pirolli & 
Andemm, 198S)» this principle implies that with sufficient practice experts can store and access 
virtually unlimited amounts of infonnation m LTM with the speed and reliability normally 
associated wiA STM storage and retrieval. 

How does the development of highly efficirat LTM encoding and retrieval processes relate 
to drilled performance? Theoretically* efficient and reliable storage and retrieval processes 
enable experts to circumvent basic information-processing limitations, particularly limited STM 
capacity and relatively slow LTM encoding processes (Sim(Xi, 1976). that severely constrain 
novice performance on most complex tasks. In effect, the development of skilled memory 
enables experts to increase their working memory capacity for familiar materials. Increasing the 
amount of infonnation available for processing and the speed at which it can be accessed should 
increase the overall processing capacity of a system per unit of rime. To the extent that complex 
cognitive skiUs require die ap[m>priate sequencing of elementary cognitive o^>erations. skilled 
memory theory, through its postulated retrieval structure mechanism, suggests a means by which 
ef^cient ^ocedural control is achieved. The enhanced processing capacity predicted by SMT is 
consistent widi the speed and accuracy that typifies expertise in complex cognitive skills. 

3. The Training Studies 

Two longitudinal studies of the acquisition of cognitive skills have been completed under 
this contract The first focused on the skill of an otherwise normal subject (DD) who increased 
his digit*span to over 100 digit with 4.5 years of practice usmg the mnemonic ^siem invented by 
Chase ft i.ricsson*s SF. His level of performance is the highest ever observed on this task, 
exceeding fay more than a order of magnitude the performance of nonnal subjects. Because prior 
analyses of DD*s skill had established that his performance was consistent with the priiu:iples of 
skilled memory, investigation focused on analyzing the mechanisms underlying his skill, their 
properties, a^ their interacticm. 



The second study followed two college undergraduates who became mental calculation 
experts by practicing for three to four years with strategies preferred by previously studi^x! 
experts. This stu^ represented an important step in generalizing SMT; its aim was to test now 
well predictions derived fixm SMT could account fot expertise in a task where superior 
infoonaticm retention is not the princq)al goal, but i$ necessary for fast and accurate performance. 
Analyses of fbtk skills have added depth and breadth to our understanding of expertise, its 
cognitive foundations, and, especially, skilled memory and its role m expert perfcmnance. 

Tbt main enq)srical findings of diese studies are summarized following a brief description 
of Ae training procedures used. 

3.L Practice Regimens 

3«LL Digit*Span Trarning Study 

DD*s practice regimen involved practicing the standard forward digit-span task under 
laboratory observation 3 to 5 days per week. He began training with as many as 26 trials per 45 
minute session, however die number of trials per session gradually decreased as iAs span and the 
duraticm oi each trial increased. For more than 60% of his 1079 practice sessions, he received 3 
trials per session. 

The ""i^Mtown** procedure was used to measure DD*s span (Ericsson, Chase, & Faloon, 
1980; Watldns, 1977). Under this procedure, span is measured as the longest list that can be 
recalled on 50% of the trials. Each trial begins with an experimento* gr/ing DD the length of the 
forthcoming list Once DD indicates he is ready (after a brief period ol prq)aration), the list is 
read to him at a rate of one digit per seccmd. Following a interval of silent list n^lmml, he 
begins his serial recall If his serial recall of a given list is correct, one digit is added to 4he length 
of the list presented en his next trial. The next list is reduced by one digit if DD*s serial recaii is 
not perfect 

In most practice sessions, DD provided a verbal report of how he encoded tiie items in each 
list immediately fdlowiiig his serial recall. After the last digit-span trial, each practice session 
ended with a free recall task. Here, DD was asked to recall as much material as he could from the 
previously presented digit lists in any order. All sessions were reccmled on audio ti^. 
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Throughout DD*s training, experimental sessions were frequently inserted in place of 
t^ctice sessions. These sessions were used either to conduct exploratory and more formal 
hypodiesis-testing experiments or collect concurrent verbal protocols. 

3.1^ Mental Calculation Training Study 

Two undergiaduate volunteers, GG and JA, routinely practiced mental multiplication under 
laboratory observation for approximately 1/2 hour per day, 3 to 5 days per week. JA attended 
268 sessions for approximately 175 hours of practice over a 3-year period. Over 4 years GG 
accumulated 618 sessions resulting in about 300 hours of practice. Practice consisted of solving 
multiplication problems using an unconventional, general computational strategy employed by 
die majority of expert calculators whose methods for multiplicaticm have been documented. The 
trainees' practice sessions were frequently augmented or else replaced by sessions in which either 
verbal pifotocols were taken or experiments were conducted. 

In practice sessions two manipulations were used to systematically vary the memory 
demands of the multiplication problems givec. These manfpulations involved independently 
varying problem size and presentation conditions. To vary problem size, die trainees regularly 
practiced on problems whose multipliers were either one- or two-digit numbers and whose 
multiplicands ranged frt>m one to five digits m magnitude. These manipulations produced the 
nine problon-size categories shown in Table 1. All p^-oblems were randomly generated and 
presented in blocks containing one problem from each size category. 

Two modes of problem presentati(m were employed, oral and visual. In the oral condition, 
problems were read to Uie subjects. After receiving a subject's ready signal, an experimenter ^ 
would read first die larger of the operands of the current problem, pause for approximately two 
seconds, and then give the word ''times'* followed by the second operand. The operands of 
visually presented problems were typewritten in the center of 3 x 5 cards, according to die display 
omvention shown in Table 1. The E would simply display tbt printed card face to present a 
problem to the subject in the visual-presentation condition. 

All additional procedural difference distinguished the two presentation conditions. Visually 
presented prcblems remained displayed until die subject either gave an answer or gave up. Thus, 
problem operands were available diroughout the course of computation in die visual condition, 
whereas oral presentation required subjects to maintain bodi operands of a problem in memoiy to 
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solve tte problem successfully. The extra memory load imposed by oral problem piesei^tation 
was exp^ted, therefore, to increase problem difficulty. 

One further manipulatioa was employed as an instructional intervention. It involved only 
ooc of the trainees (GO) and occurred after he had practiced for 500 sessions. Ph>blcm-by- 
problem aiudyses of the procedures used by the trainees showed that the general algorithms they 
used to perform 2x multiplication differed both iri compuUtional complexity and in the memoty 
demands they imposed Process models of th^ trainees' procedures (discussed below) ccmfirmed 
these findings; JA had devised and used a general algorithm for two-place (2x) multiplication that 
was more efficient dum GG's. In an instnicticmal session between UG's SOOdi and 501st practice 
sessions, JA*s strategy was explained to GG and he was told to iqjpty this strategy to all 2x 
problems he received in subsequent practice sessions. This experiment provided an opportunity 
to study the flexibility of a heavily-practiced sidll and test the validity of the model on which this 
intervention was based. 

32. Expert Perfomuince: Some Empirioil Generalizations 

Figure 1 shows the digit-spans t f DD and Chase & Ericsson's SF as a function of practice. 
Notice the spans of both individuals at the beginning of practice; both fall within range expected 
for normal adults of 7 plus-or-minus 2 digits (Miller, 1956). Over the course of practice, DD 
increased his span to 106 digiu. To put his performance in perspective, DD's npan exceeds the 
average norcaal span by qjptoximately 50 standard deviations. DD's span exceeds SF's span of 
84 digits by roughly 25%. There is no evidence that DD's span represents an absolute ceiling on 
his perfwmance. In sessions testing DD*s supraspan serial recall ciqwibilities, he has recalled lists 
as long as 110 digits perfectly. 

DD*s mnemcmic skills have been compared to other mnemonists on another benchmaA 
mcmoiytask. The Luria Matrix (Luria, 1968) is a menK)ry task that 

skills of several exceptional mnemonists (Ericsson & CbMSt, 1982; Hurt A Love, 1972). The 
matrix is a symmetric displ^ of 50 nmdom digits arranged as shown at the top Ona 
t'Mt trial subjects aie given the matrix to snidyfcv a series of se^ These 
trials involve recalling tiflier all of the matrix elements or a spccffied subset of elements in a 
prescribed serial order. The various recall tasks are shown in the lower half of Figure 2. 
Subjects* instructions are to minimize stud time on the matrix without sacrificing acctmcy in the 
test phase. 

iO 
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After over four years of practice on the digit-span task DD was tested on the Lima Matrix 
over 12 consecutive sessi<ms. He received four matrices per session. After study of each matrix, 
upon his signal the matrix was removed and he received six consecutive recall trials of its 
contents. The first and last were always '^entire matrix** trials. The intervening trials contained 
the remaining partial recall trials, their order being counterbalanced across all matrix 
presentations axul unknown to DD fct any given matrix^ 

DD*8 performance in the testing sess' ;us was impressive. His recall performance averaged 
983% correct for his first recaU of the entire matrix across all sessicms. His mean accuncy over 
tail remaining recah tasks (second cohunn down, third column down, third column tq>, zig-zag, 
2nJ entire matrix recall) wa^ 993%. His mean study time along with his mean recall times for 
each CA\^^ recall tasks^ are given in Table 2 along with similar measures of performance taken 
from expert and novice subjects. His siq>eriority on all measures indicate Aat his skill is not 
limited to the digit-span task, and shows some flexibility. In general, the data indicate that DD'r, 
mnemonic capabilities represent die peaks of human memory performance. 

Turning to the results of the mental calculation training smdy, Figure 3 and Table 3, 
req)cctively, show the improvements in the trainees* speed and accuracy for problems differing in 
size and mode of presentation. Figure 4 compares their final solution times as a function of 
{TOblem size and presentation mode against the performance of a mental calculation expert (AB) 
independenUy regarded as one of the world's best^. In shmt, both GG and JA have achieved 
levels of performance that qualify them as experts. 

The achievements of DD, GG, and JA are important in several respects. Fint, they provide 
infctmation about the relative contributira of different variables to the acquisition of expert-lev 1 
cognitive skills. Second, diey show that expertise can be acquired with less than a decade of 
pnctke. Third, they demonstrate that interventions that teach e?q>ert strategies can fSKilitate the 
development of expertise. Fourth, the databases describing their performance and leanung 
represent benclanarks for testing recently*proposed global theories of human cognition. 

Neither DD, GG, or JA began pracdce with any identifiable excq>t!<Mial intellecbial 



^Tlie time li««d for entire nwtrix rccill itfJectt hit fir»« triiJ. Hii tecood recall of ^hc entire mttrix ivengci 10 
•ecoodx fmer. 

^All three Aow comp««bIy low overall error tUtM, 
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abilities, talents, or cognitive capacities. Tbeir achievements, along with other recent studies of 
expert performance (CeCi & Liker, 1985; Schneider, Korkel, & Weicnrt, 1989) indicate that such 
pftdispositioQs, particrJarly high levels of general intelligence, are not essential for acquhing 
expertise. Rather, the changes observed in performance as a function of practice su{^x>rt the 
claim &at practice is die key variable, as Simon & Chase (1973) argued neariy 20 years ago. 
This coQCiusioc does iK>t deny the importance of other variables. Individual differeiices in 
aptitudes and previous experience in related domains may well differentially affect ntes of skill 
acquisition or determine peak performance levels. TTiis is an issue that calls for systematic study* 
Motivation is an in^xHtant variable (Cbamess, 1989); subjects who devote years of practice to 
any demanding task are obviously very highly motivated. 

By the standards of most psychological studied, the dtrration of these training smdies is 
long, however tlx? amount of practice needed by the trainees to bec<mie aq)erts is short by 
standards found in the literature on expertise. For instance. Van Lehn (1989) sutes that the label 
expert is usually reserved frr individuals with several thousand hours cf experience. Hayes 
(1985) has argued that a minimum of a decade of practice is necessary to become a world-cUss 
expend Differences among tasks, subject differences, and differential training conditions caution 
against making broad generalizations about Hat amount of practice needed to become an expert, 
however the results of the digit-rpan and mental calculation training studies reveal that the 
investment cf time, effort, and resources required to achieve erpert-level skills is no^ as great as 
prior studies suggest. 

The training studies show that the processing strategies «nibject practice are important 
determinants of their learning and performance. This holds true whether the strategics are 
discovered by the subjects or explicitly taught to them. In botii the digit-span and mental 
cak:uiati(m studies, DD, GG, and JA were instructed to use strategies known to be used 
effectively by previously studied experts. Additional subjects also c^ginally involved in these 
studies were observed as they practiced under identical conditions. They were instructed to use 
different strategies, however. Whilst all subjects shewed improvementt none using the strategies 
predicted to be less effective progressed much, eitiier in terms of practice or performance, rektive 
to the those given the expert strategies 

For instance, at the luune time DD began practice. Chase & Eri'^sson (1982) also gave 
another subject extended practice on *he r*igit-span task. She was given a strategy that promoted 
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the OKoding of digits as meaningful chunks, but did not lead to the development of a retrieval 
stnicture Her spm improved at a rate c(»nparable to SF's and DD*s, but she hit an asymptote at 
Ig digits, grew frustrated, and quit practicing. 

The mental calculation study (^ginally began with 6 subjects (Staszewsld & Qmt, 1984). 
Using random subject assignment, three were instructed to use the t?iditicnal right-to-left 
procedure, whereas GG» J A, and arodier were assigiMd the left-to^ght strategy. Although! large 
individual dififeiences were evident in subjects' performance, as a groq), subjects m the rig|it-to- 
left conditicm were slower and less accurate than dieir coixM and the marginal advantage for the 
left-io-rigbt group increased as demands relate! to problem size and presentation mou^ increased 
Subjects in the ri^t-to-lcft groq> also exfuessed more frustraiicxi dian than th<»e in ibt left-to- 
right condition* Two of the ri^t-to-left trainees quit before completing 12 sessions and the third 
completed only 22. Al&ough the low number of subjects involved and lack cS coattcH of over 
other subject factors make it difiicult to unambiguously attribute the achievements of DD, GG» 
and JA to the strategies they were given, the pattern of subject pedbrmance, attriticm, and 
strategies obs^^vai in Aese studies suggests they played a very important role. 

The clearest evidence that strategy-based interventiras facilitate the develoraient of 
expertise is die improvement observed in GG's performance following his iuroduction to and 
adoption of JA's 2x computational strategy. A significant feature of tiiis intervention is that it 
was aqq)lied at a relativdy advanced stage in GG'^ traming. Its success demonstrated diat a 
heavily jvacticed perfcmner can learn and beuefi i^vdi new and effective strategies, without 
suffering the heavy negative transfer diat some studies of skill acquisition have found (Shiffiin & 
Schneider, 1977). 

One further accomplishme&i of this project is tiiat it has contributed two large and ridi data 
sets to the empirical database op human e3q)ertise and its acquisition. Studies of human expertise 
have had a tremendous impact on cognitive themy and research over the past twen^ yean, 
however the data base on which our understanding ci expertise rests is disproportionally small. 
The scarcity of e:ipett subjects and the time and resources requnod to thoroughly analyze their 
skills explains this situation. The datasets from these studies add significantly to die txped&t 
database, containing quantiuiuve and qualitiative measures of the trainees* i^tice performance 
(including dbronometric. enor^ and verbal report data) covering thousands of practk^e trials. 
These databases not only ^^escribe human performance at noait of the hi^iest levels ever 
systematically observed, they also describe the learning that led to these levels. 
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The availability of tbese databases is particularly relevant to the emergence of **cognitive 
architectures** m cognitive science. Exemplar include ACT* (Anders(Ki, 1983), SOAR (Newell, 
RosenUoom, & Laird, 1989), ICARUS Q^gely, 1989), and onmecticmist architectures 
(Rumelhart & NfcQelland, 1986). Implemented as simulaticm systems, d^se architectures 
represent general theories of human cogniti(m designed to account lor human petfoimance and 
learning over a wide range of tasks. Their validity as general theories of cognition rests largety 
oa the range of cognitive perfonnance winch diey can accuratety model Because the databases 
built up in this project describe extremes of human learning and perfonnance, modeling its 
findings rqmsents an acid test of currat and future architectures. 

3 J. Cognitive Foundations of Skilled Memory 
Expertise, Knowledge, and LTM 

The fundamental assumption underlying the concept of skilled memory is that 
, 4)rovements in perfonnance as a result of practice are due, at least in part, to subjects* 
increasily efScient use of LTM to encode, maintain, and retrieve information critical for 
successful task performazxe. SMT asserts that experts expand woridng memory cq>acity by 
learning to use circumscribed porticms of LTM to maintain infonnation needed for future 
{TOcessing in a readily accessible state. Organized knowledge acquired duou^ practice in 
specific tasks or dcmiains mediates experts* LTM encoding and retrieval processes. Evidence 
supportiBg tbese asserticms is described in the following secticms. 

Support for the LTM Storage Hypothesis 

Evidence that LTM, rath^ than STM, is die locus of DD*s list storage comes from several 
sources. One is his perfonnance oa the final free recall task tiiat concludes each of his {mctice 
session. He recall for the contents of the lists presented in diese sessiras is in^xessive (Mean « 
93%, SD « 4%) This level of retention is cleariy inconsistent witii dcta on STM forgetting 
(Brown^ 1958; Murdock, 1961; Peterscm & Peterson, 1959) and quite consistent with the skilled 
memory e£fect Enduring knowledge rqmsentations are predicted, if information is encoded in 
LTM. 

Further eviuence for LTM storage comes from an experiment using a variant of tbt Brown- 
Peterson distractor paradigm. In tbis experiment DD was given digit-span trials in which lists 



14 



12 

either 25, 50, or 75 digits^ long were read to him at the rate of one per sec. Unlike usual practice 
procedures which allowed DD to initiate serial recall when he was ready, a distractor task 
involving visual search was interpolated between list {msenution and DD's serial recall. The 
distractor interval lasted either 1,2, or A rnin* Results showed no effect of the distractor upon the 
accuracy of DD's serial recall, regardless of its duration. Ifis aca^'acy averaged 99, 98, and 95 
percent correct for 2S-, 50*, and 75^git lists, respectively. 

Finally, some coaq)elling evidence for DD's storage of lists in LTM comes bom a study 
testing his memory for a 100 digit*list after a 24jiour interval Fbllowing two consecutive days 
without practice (to minimize inutference firom previous days' lists), DD was presented with a 
100-digit list in what he assumed was a nonnal practice session. After his perfect serial recall of 
the list, the nperimenter feigned an equipment problem to end the session. The next day DD 
arrived for practice at the same time as he had the previous day. To his surprise, he ms aske(* for 
a serial recall of the list he had received die previous day. He recalled 99 of the digits in their 
appro{»iate locations. When told his recall was incorrect, he spontaneously identified the 
incom^t digit and its location, and gave the correct (me. He then rqxnted that during retrieval, 
he had narrowed search to two candidates, but not sure which was the correct digit, he guessed. 

Variants of the traditional memmy span testing paradigm have produced results that rule out 
the counterargument that DD does not store information in LTM, but, rather, has somdiow 
expanded STM capacity. If this argument were true, his span for all materials that he encoded in 
STM should be enhanced Howeve^. when either alphabetic symbols or words are read to DD at 
the customary one per sec rate, his span^ for these materials resembles diat of unpracticed 
i^abjects (Crannel & Parrish, 1957). It is also known tiuit STM retention is minimauy affected by 
changes in item presentation rate relative to the large adverse effects that speeding up item 
presentation has on LTM en':oding (Glanzer & Cunitz, 1966; Murdock, 1962). When DD's span 
was tested using digits presented at a rate of four per second instead of the customary rate, his 
span measured at 10, an order^of-magnitude decrease £rom the span exhibited tmder omnal 
practice conditions. These sizeable reductions in his memory span with changes in either 
materials or {msentaticm rates indicate a high degree of specificity to die conditions of practice. 

^VO*t apm at ^be time of Uiis «q>uimeat wu in excea of 100 digits. 

^Ilie up-6om procednre was «lao em|^ed for all experimenU meMning DD*s memory ipan. 
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Ccnsistent with the principles of skilled memory, results fhrni free recall and recognition 
tasks show diat expert mental calctjiators also use LTM to store vital infcxmation during their 
coss^nitatioQS. 

One of the i»ocedures for testing GG*s and JA's LTM retention im similar to ttM used in 
testing DD's final free recall After solving tike fical problem of a pn^tice ^essic^ ^ taiinees 
were asked to recall as many of the problems presented in the sessi(tt as accurately as possible. 
For most sessions, advance notice of die recall task was given pri<»r to the start of calculation 
practice. On a number of occasions, ttie trainees received no advance wariiinj. A problem la^as 
scored as correctly recalleu if the number of digits in bo!h operas N was correct and SC% or more 
of the digits were given in their pn^>er places. 

The trainees* recall perf(»rmance was consistent with the predictions of SMT. Recall testing 
began after GG had accumulated 9? practice sessioris and Ja 136. In the initiai testing sessions, 
GG recalled an average of 31% of the problems presented in the course of a practice sessi<m and 
JA recaUed 37%. With further cakislation practice, JA's recall gradually rose to 46% near the 
end (tf his training and GG*s reached 61%. Consistent with studies of experts* incidental memory 
(Lane and Robotson's (1979); DD's 24*hour delayed recall cited above), the trainees' level of 
recall was unaffected by whedier or not they knew they would be tested for recall on a given day » 

Although their ability to recall {H-oblems supported the LTM coding hypodiesis, it was 
bothers(xne that the amount of material GG and JA could recall was snuU relative to the amotmts 
recalled by SF (Chase & Ericsson, 1982) and DD in post-practice free recall. A plausible 
explanation was that the problems presented to the trainees in the course of a practice session 
(and the memuy representations generated in solving them) interfered w^h their recall of 
information £rom previously presented problems. This account was supported by an analysis that 
showed that the probability of recalling a problem increased as a functicm of its i^esentation order 
within a session for bodi trainees. In li|^t of this int^erence, a recogniticm task was used as a 
more sensitive means of testing die calculators' reliance on LTM storage. Following each of two 
practice sessions, GG, JA, and AB^ were presented with a set of 108 problems aad asked to 
differentiate between diose presented m the immediately preceding practice ^essicm and those 
drawn from a randomly-generated set of distractoi^. Results showed duit the subjects could 
easfly distinguish old problems from new on tte majority of trials (GG 92%, JA 87%, AB 82%). 

^Tbe wodd-duc calculator whom cakuladon ddll served m a tasis for comparing the traiiseef* peifoniiatice. 
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Ccmsistent w the knowledge-based coding hypothesis* evidence for practice-specitic 
mefflc»y coding skills is also shown in expert mental calculation. Although GG solves 2x4 and 
2x5 problems faster and more accurately than AB* his advantage does not extend to larger 
problem sizes. In other words, GG's expertise is limited by his experience; his performance falls 
when he is presented with problems larger lhan those regularly received in practice sessions. 

This specificity is shown by CP's patformance on randomly generated 3x3s and 4x4s, 
problem sizes which AB practiced re^lasly* bi'>t GG had never practiced. Tested on 3x3s, GG*s 
average sohition time for 12 proMems (Mean ^ 4123 stc, %SD = 25.78) was ahnost double that of 
AB (Mean » 21.G9, SD 17.43), while showing conq)arabIe accuracy. Presented with eight 4x4 
problems, GG experienced extreme difticulty and quit computation before reaching an answer cm 
aU but the last two. He correctly solved only the last one. His computation times exceeded five 
minutes (%\ all of the 4 x 4s, and his overt behavior on all but ihe last trial resembled diat of a 
novice stmggling to solve a 2x5. On all but the last two trials, GG cited memory overload as the 
reascm for fa^licre, stating specifically that he '*didn't have the right schemes for finding and 
manip!datl2ig the n^anbers/* Significantiy, after his attempt on the next-to-last 4x4, the first one 
for which he reached an answer, he reported discovering an effective representational scheme. 
His successful solution of the final 4x4 confirms his report. 

In general, the evidence for LTM storage and the specificity of superior retention skills 
observed in these studies supports key assumptions of SMT. These findings, along witti the 
generality of the skilled memory effect, suggest that experts in a wide variety of other domains 
strategically use knowledge to represent and maintain task-relevant information in LTM. 

3.3 JL Mechanisnns of Skilled Memory and their Impkmentations in Expert Skills 

SMI asserts that two types of memory iniechanisms mediate experts* LTM encoding and 
retrieval: semantic memory representations that recognize and encode information as meaningful 
chunks and retrieval structures that inde^. chunks in LTM in a way ttui facilitates their later 
retrieval in the appropriate serial order. It asserts that these mechanisms develq> with practice, 
ami^ once they are established, further practice increases the efficiency with which they are used. 
The following sections review the evidence that skilled memory euppoits the remarkable 
performance of DD, GG, and J A and, in the process, describe how its component mechanisms are 
inq>lemented by these subjects. 
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Exceptional Memory Sidll: Structure and Process 

Substantial progress has been made in developing an information-processing theory of 
DD*s skill. A variety of studies were performed to analyze the encoding operations he employs 
during digit-span triab, the rqmsentations he creates, and the {nocesses he uses to retrieve 
snfofmaticm from LTM. Their findings have led to the identification of knowledge structures that 
mediate DD*s encoding and retrieval operations as well as a fairly detailed description of the 
functicmal organization of diese components of his skill and the nature of their interactioa. 

In theory, the key to DD's excq>tional memory is his ability to quickly create elaborate, 
organized, information-rich LTM codes for the short 3- or 4«digit sequences into which he 
systematically parses digit lists and (b) and to access these rqyresentaticms for orderly retrieval. 
These encoding and retrieval operations are inter-related and mediated by three types of 
knowledge r^tructures in his knowledge base. These mechanisms are identified as (a) his semantic 
coding system, (b) his retrieval structure, and (c) his contextual coding system. 

The contents of Tables 4 and S provide a concrete context for describing these mechanisms 
and their role ic DD*s performance. These tabl^ contain verbatim transcriptions of vecba! 
reforts given by DD after digit-span trials in which he perfect recalled a 7S* and a 5(Migit list 
Fbr sake of exposition, the left-hand cohimn of numbers in each table displays the contents of 
each Ust organized according to their grouping in each report. In these repoits, DD describes how 
he encoded the elements of each list. 

Retrieval Structures 

DD*s reports reflect both how he organizes his encoding and serial recall of digit lists as 
well as the mechanism that imposes a common structure on bodi of these processes. As his 
reports show, DD*s basic unit of encoding is a digit group, either three or four digits in size. 
These protocols reveal regularities in his formation of digit groi^ that generalize far beyond 
these particular lists. 

A salient feature of Tables 4 and 5 is the redundancy in DD*s list parsing. Examination of 
the arrangement of digit groins in these protocols reveals an abstract hierarchical scheme of 
organization* As diese repmts show, DD*s digit groups are arranged in sequences of uniformly- 
sized groups, sometimes four, but mostly three groups hi length. These sequences form higher* 
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order units labeled "supergroups." SvU higher-order units, called "supergroup clusters" pair 
supefgxoiq)s made of 3*digit groups and 4-digit groups. The scheme for organizing these units is 
clearly illustrated in Table 4. This protocol shows that DD organized the first 16 items of this 
7S-digit list as a single 8iq)ergroup composed of four consecutive 4-digit groups. The next 21 
digits represent his first STq)ergroup cluster. It consists of a supergroup of three 3-digit grciq)s 
followed by a siq)ergroup of three 4^git groups. The supergroup cluster pattern is repeaied 
again, and once again, albeit incompletely in this list Table S shows that the same organizational 
scheme is applied to a SO-digit list for as far as its di^ts extend. 

The common structure that these lists share is neither coincidental nor idiosyncratic to these 
particular lists. Analyses of DD's protocols, analyses of his errors on digit*span trials, and 
cbronometric analyses of his list encoding, serial recall, and memory search (Tech. Rept., in 
preparation-a) show tiiat the organization of the lists shown in Tables 4 and 5 reflects a general 
organizing strategy based on an abstract, hierarchical structure. A variety of studies {Resenting 
DD with 50- and 7S-digit lists showed that he unifonnly i^lied the organizational schemes 
instantiated in these protocols. Throughout his training he also ^lied this general scheme to 
lists larger and smaller, truncating or adding abstract organizational units as the length of the 
particular list dictated. The unifcmnity of this scheme across different list lengths can be seen in 
Figure 5, which gr^hically represents his organizing schemes for lists SO, 75, and 100 digits 
long. 

The mechanism underlying DD*s organization of lists is called a retrieval structure. 
Essentially, this mechanism operates as a memoiy mdexiug system. It is used by DD to store 
semantically encoded digit groups in LTM in a way that (a) aiaintains Ae ordinal relations 
between die groups and (b) supports a systematic retrieval strategy. In general, DD's retrieval 
structure anticipates the problem of retrieving a Irjge number of coded digit po^xps from LTM in 
proper order and encodes these items so that pr/ocesses governed by this mechanism can access 
them. 

To make the concept of an indexing system more coiurrete, consider some common 
examples: postal systems diat organize mailing addresses, lib^ systems (e.g., Ae Dewey 
Decimal System) tiiat specify the locations of books in a library, and address systems used to 
locate a particular piece of information in the memory of a digital computer. All of these systems 
define locations widiin an abstract relational scheme where a particular contoit can be 



17 



'^deUveitdt'* stored, and later accessed. The key to their effectiveness is that some '^content'' is 
stored and addressed according to a stable system, so that knowledge of the addressing system 
later mij^rts organized search and efficient retrieval. 

The nviemonic system known die method of loci represents a specific memory indexing 
sysiem ditat has several of the basic properties that characterize DD*s retrieval structure. 
Individuals using this mediod memorize lists of items by first forming an image of each to-be- 
remembered content item then associating each image with i predetermined physical location 
found in a well-known physical environment. Retrievin;^ aie stored items involves mentally 
traversing the path connecting uie locations and "^picking up"* the content associated with each 
location. In theory, recalling each location provides a set of cues for recalling the associated 
items. The key parallel between this system and DD*s is that a familiar, s>*sU;matically organized 
body of knowledge, whether it be of a physical environment or an abstract set of relations, 
coordinates the storage and retrieval of information. 

iTi essence, retrieval structures are mechanisms that organize and coordinate encoding and 
retrieval processes. They combine a well-structured knowledge base with processes that cerate 
npm this knowledge to systematically generate "addresses'* for storing information. These 
addresses** consist of abstract features associated with to-be-remembered content at fbt time of 
storage. At the time of recall, die retrieval structure is again invoked to systematically regenerate 
the ^:ame addresses, which then serve as retrieval cues for accessing die stored information. 
Using the method of loci example, this translates to using a familiar path dirougb a familiar 
region to generate the imaginal locations to which to-be-remembered items are associated Tc 
retrieve die stored items, one systematically regenerates the sequence of locations to access di(dr 
associated content. 

Evidence for these claims about retrieval structures and their role in DD's skill come from 
investigations of his petformai^ on several tasks (Teclt Rq>t. 89-1, in {reparation). For 
example, protocol data like those in Tables 4 and 5 were used to generate models of retrieval 
structure organization and which were then validated usin,; cbronometric analyses of DD*8 serial 
recall. This work showed tiiat DD*s serial recall for lists of varied length was organized in the 
manner predicted by the structural models. Application of these same models to temporal 
patterns obtained from studies of self-paced list encoding showed diat the same structmes were 
used to organize DD*s list encoding. The isomorphic relation between DD*s li8t encoding 
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processes and his retrieval processes for serial recall was al^o shown by substantial correlations 
between the pauses in DD's list encoding and the pauses in his serial recall of individiud digits at 
idoitical list locations. The additional finding tiiat the temporal patterns that characterize bodi list 
encoding and serial recall are hig^y intercorrelated over the overlapping portions of dififerent- 
sized lists supports the claim that a smgle, general retrieval structure is used consist<^tiy. 

The idea that digit groups are addressed in LTM with a unique set of features that identify 
their relative location within DD's retrieval structure h£$ been tested in several ways. In general, 
this view implies that encoded digit groups are relatively independent of one another in memory 
and leads to tiie following prediction: given the location or address of a digit grotq> within a list 
of specified lengtii, DD should be able to both encode and retrieve the digits associated with that 
location flexibly. Two experiments were performed that tested tiiis prediction. 

In tiie first, lists 25 and SO digits long were presented to DD under self-paced ccmditions. 
One digit was presented at a time on a CRT and remained displayed until a response from DD 
displayed the next digit. The digits were not presented in the order in which they were later to be 
recalled, however. Rather, a graphic representation of his retrieval structure was displayed and a 
pointer indicated where a particular sequence of three or tour digits belonged within this 
structure. For each location, the sequence of digits was given in tiie order in which they were to 
be recalled The order in which die locations were presented for study on each trial varied 
randomly. Following Uie "mixed" presentation of digit groups of each list, DD was asked to 
recall die list as he usually would, first witii die initial four groiq>s of fom digits, the next three 
groups of tiuee digits, nnd so forth. Despite ^e novelty of tiiese list display procedures, DD's 
serial recall for 25- and 50-digit lists averaged 95% correct This result is omsistent witii die 
hypoUiesized independence of retrieval structure locations and demonstrates this independ^mce in 
list circoding. 

A memory search experimrat modelled on Sternberg's (1967) memory scaniung paradigm 
demonstrates similar flexibility in retrieving digit groups (Tech. Rept. in prq>arat?on-a). In tius 
study DD received 50H!igit lists read at a rate of rae digit per sec. Fcl'owing list ptesentation, he 
was given a series of cued recall trials. These trials presented «iigit groins from randomly 
selected retil^val structure locaticHis as cues. Dq)ending tqxm the seaiich condition specified prior 
to list presentation, DD's task was to give die entire digit group that eiiber preceded or followed 
the cue in die list. Results showed tiiat DD retrieved die correct digit group on 94.4% of trials. 



ERIC 



21 



19 



This level of accuracy compares w«th the accuracy with which novices retrieve information from 
STM (Sternberg, 1967). Analysis of both response latencies and posttrial protocols showed (a) 
that the pattern of DD's latencies was omsistent with a model oi retrieval that assumes retrieval 
structure mediation and (b) that DD could use the cued information to directly access information 
specifying cue locations. 

To sununaiize, evid^e from a variety of tasks and measures provides siqjport hypothesses 
aboiU the form imd function of retrieval structures. 

Semantic Encoding: A C3iunking System 

Another salient feature of DD*s list encoding that is evident in Tables 4 and S is his 
categorical labelling ox each dig.( group. Essentially , his tqnesentation of each groiq) in terms of 
either a running time, an age, date, cm: miscellaneous pattei:) reflects his imposition of subjective 
meaning iq)on otherwise random information. The advantage of making meaningless information 
meaningful has been known since the time of Ebbingbaus (1964, originaUy published 188S) and 
is well-established expenmentaUy (Crowder, 1976). Not stirprisingly, this general strategy has 
been identiHed as one commonly used by exceptional mcemonists whose memory skills have 
been studied under laboratory conditions (Ericsson, 2935). The implication is ttM these memory 
experts can quickly relate new information to existing knowledge to exploit information in LTM 
as a mnemonic aid 

One of the noteworthy accomplishm^t of this project is the detail in which it has analyzed 
DD's semantic coding processes end their undeiiying knowledge. We have foimd that DD's 
ability to create a meaningful memory representation for any random digit sequence he 
encounters is supported by an elaborate, semantically-rich, hierarchically organized knowledge 
base whictt al'c stq>ports multiple retrieval strategies. 

How can DD's memory representations be characterized in terms of structure and content? 
His protocols reveal that be uses a ^nall set of abstract coding structures to encode digit groins. 
These structures are presented in Table 6. The content the9« structures take is presented in Table 
7, which shows the semantic categories that he uses to give cdierence and meaning to otherwise 
meaningless digit sequences. 
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The protocols in Tables 4 and 5 clearly indicate that DD*s encoding of digit groups in terms 
of their semantic content mvolves more than simply assigning a category label to a digit group. 
Rather, they show diat he frequently assigns a number of meanmgful features to create a weU- 
elaborated representation for a digit group. For instance* he divtinguishes the sequence 420 not 
jizst as a l*milc time but as good high school mile time.** He encodes both 6938 and 5802 as 
lO^e timest but difiexentiates between them by noting that die fcxmer is **a leally slow 10* 
mile" whereas the latter is "a good pace 10-mile." The sequence 142 is represented as a 1/2-mile 
time, "ri^ around the world record.** The sequence 063 is encoded not jus'i as an age, but as an 
age **right around retirement time.** 

The beneficial effects of meaningful elaboration \spon recall are well established (Bobrow & 
Bower, 1969; Craik & Lockhari, 1972; Hyde & Jenkins, 1969; Stein & Bransford, 1979). In 
theoiy, there are several advanuges. First, elaborating a representaticm enhances the pfx>bability 
of recall by increasing the number of potential retrieval cues (Tulving Sc Thomson, 1973) or paths 
(AnderscMi, 1983; Anderson & Reder, 1979) that can be used to retrieve a stored refmsentation. 
When several representations share features tiiat raise the threat of interference, the presence of 
distinguishing features teduces tins Uireat It appears, however, that there is aiK)dier feature of 
DD's semantic encoding that contributes to his superior recall perforaiance. This is the elaborate 
organization of his semantic knowledge base. 

Several sources of evidence suggest that DD*s semantic knowledge base is organized along 
the lines of tiie semantic network pictured in Figure 6. One source of ttippoti for this 
refxesenutional hypothesis comes bom his protocols. The elaborations that frequendy qualify a 
general category label suggest tiiat DD uses a multileveled hierarchy of conceptual categ(mes to 
encode digit groups acccmling to their membership in a set of stable* well-defined classes. 

Supporting evidence c<Hnes from the organization of DD*s recall on the free recall task that 
conchides digit-span practice sessions. Analysis of his recall protocols shows that his recall is 
organized by his semantic encoding categories. The initial digit groi^ be reports are those 
encoded as 1/4-mile times. When he can recall no more 1/4-miIes, he then turns to recalling 
IfZ^lt times, again reporting as many of tiie digits groups coded with this label as be caxL He 
then proceeds to recall 3/4^e times, 1-mile times, 3-kilometer times, {mxreeding through the 
coding categories in die order in which they are listed in Table 7. Analysis of his recall firom a 
10-session sample shows that 94% of all digit groups recalled are clustered within categories. His 
recall of items within these categcxies is ordered according to the magnitude of the coded values. 



21 



Further support for this rqyresentational hypothesis comrs from recently constructed 
sifflulaticm programs developti to model DD's semantic encoding of digit groiq>s and the 
organization exhibited in his final free recall. The encoding model assumes that a senumtic 
network organized alcmg the lines of the network in Figure 6 governs DD*s categorical encoding 
of digit groq>s. Parsing digit lists into uncategorizedt digit groups as DO would with his retrieval 
structure, the program perfcnms a set of tests designed to minor the decisi(m*making procedures 
DD reports using to determine senuntic codes. The results of these tests lead to a category 
assignment for each digit group. G>mparison of the program^s categorization petfonnance widi 
DD*s on identical lists shows *Hat it matches DD*s category assignments on 98% of the digit 
groups it receives. 

ilie free^ccall mode! assumes that retrieval of digit groups is governed by the same 
knowledge reimsentaticm used for encoding. This model assumes a process equivalent fo a 
depth-first t(^to-bottom activation of the nodes of his semantic network and dut activation of 
nodes and their associated labels repitsen*^ activation of cues used to retrieve digit groi^ whose 
representations contain the same semanl features. Tht model then performs a systematic 
withtn«category search as this process has been inferred fiom analyzing concurrent verbal 
protocols of his final free recall (Example: **0K, quarter-miles in 1. 1 forties... four-seven-six... 
quarter*miles in the fifties... five-eight-one, five-nine-oh, five*nine-x!iae**). Evidence for the 
validity of the model is its ability to predict the OTder in which digit groups are recalled by DD in 
free recall. The average rank-ori!^ correlation between the model*s predicticms and DD*s 
performance on a sample of items recalled in 10 practice sessions is .92 (SD ^ .05). 

Furtbi * evidence for the postulated represenution comes fiom Chase and Ericsson*s (1982) 
investigation of the internal structure of DD*8 l^e-time category. They presented DD with 
3-digit sequences that he always coded as l»mile times printed on cards. His task was to examine 
the items and sort them into gioups based Vf/on his perception of their similarity. CSiase and 
Ericsson found that DD (like SF) sotitd these items into a variety of categories that suggested a 
hierarchical knowledge structure containing several levels of mutually-exclusive subcategories. 

I have used a similar 9p;^/oach to replicate the Chase and Ericsson findings for DD*s 1-mile 
category and examme i^ther his other semantk categories were similarly structured. 
Procedures difiered from those of Chase and «3ricsson (1982) in the following respect afier DD 
would sort cards into groups* he was asked to label all grotq>s created and tikn combiM these 



2i 



22 



groups to form larger groups and, again bbel the new set of more inclusive groups. This process 
continued until DD bad produced a single group tfiq)resenting one of the categories listed in Table 
7. 

Using this procedure and the same materials Chase and Ericsson used to analyze DD*s 
iHnile category, results nearly identical to thein were obtained Presented with items diat fall 
into his 2-miIe category, DD*s sorting indicated that this category cxtAitcA a tbnilarly detailed 
hienuchical internal structure* The semantic network pictured in Figure 7 reflects the 
represenution infentd from his sorting of items firom the 2-mile category. The structure revealed 
for diese categcmes suggests a powerful mechanism for diunking random digits into meaningful 
units and also elaborating their represen^tira in a way that both associates sonantically-similar 
chunks with bigfaer-order categcny labels and difTerentiates them at lower levels. 

As Figure 6 suggests, however, DD*s semantic categories differ quite a bit in the amount of 
internal structure they exhibit DD sorts items from his l/44nile and 3-kilometer categories into 
relatively few meaningful subcategories. He reports having rK> subcategories that he cokisisteniiy 
uses to differentiate digit groiq)s categorized as 3-miIe times, dates, or miscellaneous patterns. A 
hypothesis currently being tested is that the degree of structure \;rithin a coctaig category is 
inversely related to the proportion of all possible digit sequences to which that category can be 
MppUcA, The rationale is that the more items that fall into a category or subcategory, the greater 
the ncc^ to encode items with features that discrimiiuue them to ward off interference. This 
raises the issue of bow DD deals with the threat of interference when multiple items fidl within 
his less-differeotiated semantic coding categories or subcategories. The issue is addressed in the 
forthcoming discussion of contextual etK:oding. 

In general, the simulations cf DD*s categorization aiui free recall, his verbal protocols, and 
his performance in the sorting tasks provide converging sxxppon for the theoretical description of 
the organization and content of DD*s semantic knowledge base. With titiis picture of his 
knowledge base, attention turns to how its organization supports DD*s encoding md retrieval of 
list items in the context of difl^t-sp*^ trials. 

How does the kiiKl of knowledge organization shown in these studies contribute to DD*s 
perfornumce? As indicated eariier, well*differentiated coding categories mediate his encoding of 
elaborate, meaningful, atKl relatively unique memory representations for digit groups. The 
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litenture shows that such characteristics proraote retention. But there is z m(m fundamental way 
in which the organization of DD's knowledge base supfK^ such Gocoding. The organization 
shown in DD's semantic memory reduces Ae amount of processing related to his recognition and 
encoding of meaningful patter. Of course, the organizal*^ of his retrieva. structure and the 
manner in which it reliably guides his systematic parsing of lists into codable groims serves a 
similar role and it is evident thr! the operations of Ibese two mechanisms art well integra'^^ 
Using the knowledge represented by these mechanismSt infonnatioa about the sire of a given 
digit groq) and it's first dig^t or two sharply constrains search for the appropriate set of coding 
features to a relativtty small semantic space. 

Although Db's semantic coding system represents a powerful mechanism fot recoding 
digits as meaningful chunks of information, encoding if. not its only fuucd<xi. There is also 
evidence that relates the M^anization cf his semantic knowledge to his serial recall. Thus, like 
his retrieval structure, his semantic coding system mediates both encoding and retrieval of 
mformation in the digit-span iask, although the temporal structure of his serial recall indicates 
that his retrieval structiue is the primary access mechanism. 

Tbe;role his semantic knowledge base plays in serial recall is clearest in situations in whitii 
DD experiences difficulty in retrieving a digit gicup at a particular '^si locati(»i. Most lists in the 
range of 100 digits often mclude such a group or two. They are easily distinguished because their 
retrieval times are measured either in tens of seconds or sometimes minutes. Periods of silence 
this size stand out clearly in DD's typicaUy fast and fluent serial recall. 

Coth concurrent and retrospective protocol data collected on such occasions reveal three 
knowledge^ased strategies fo^ ririeving the missing digits. When DD has a few semantic 
features about a hard-to-rccall group, be restricts his search for its col tents to a relatively small 
range of candidates, probing memory using category labels subordinate to those he holds. 
Together, the retrieval ct^ he hokis and his implicit knowledge of the organization of his 
knowledge base provide the constraints that narrow his search for additional cues. In dtuatioos 
when DD cannot recall the semantk category used to code a group, his protocols show that he 
resorts to using his semantL coding categmes in an orderly generate-and-test strategy for 
retrieval. He searches through his coding categories as he does in his final free recall, naming 
each to himself to see if he recognizes the semantic code for the group in question* If lie can 
establish a category with reasonable confidence, he then uses its internal structure to guide further 
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seaidt In the infirequem instances in which his recovery of a complete semantic code doe«; not 
produce the missing digits, he generates a sequence of digit-groiq)S testing each for recognition 
provided that the number of candidates is small. The interesting feature of these Twck-i^'' 
retrieval strategies is that Oey reveal DD*s ability to intenticmalfy exploit the organization of a 
particular data structure in his semantic memory. 

On the Relations between DD's Knowledge Ccmiponents 

Up to now, description of DD*s retrieval structure and system for semantic coding has 
implicitly emt)hasized tibeir indqpendence. Two points are in order to clarify currectassunqrtioQS 
about their interaction and similarity. 

First, dme are notewordiy structural and functional similarities between DD*s retrieval 
structure and his semantic coding system. The knowledge rqmsentatioas en^loyed by diese 
systems are both multilevel hierarchies with multiple branches at tnOx abstract level. 
Functionally, bodi are used to cvganize and encode to-be-remembered material in LTM and later 
mediate its ordcriy retrieval. Recall how retrieval ?!roctures guidr DD*s serial recall in the digit- 
span task and his semantic knowledge structure guides bo& his firee recall and reconstructive 
retrieval of hard-to-recall digit grot^s in serial recall. The content infoimatioa diat these 
medianisms generate may be different, but their structural and functional similarity is striking. 
Furdier, the similarities between these mechanisms and the mechanism known as a discrimination 
n^ in Feigenbaum and Sim<m*s (1962, 1984; Richman & Simon, 1989) EPAM model of memory 
suggest ttiat DD's semantic coding system and his retrieval structure rqmsent verj .ofdnsticated 
implementations of a general EPAM-Iike memory mechanism ndxpted to handle the demands of 
the digit-span task. 

Seomd, it is important to realize diat the operaticm of these mechanisms has to be extremely 
well coOTdinated, particularly Airing list encoding, considering the complexity of DD*s coding 
processes and tfie presentation rate used in tiie digit-span task. Independent theoretical argur:ients 
about thf^ rea!-time retrieval capabiliiies of discrimination nets (Feigenbaum A Simon, 1984) 
make such precise ten:^)oral meshing plausiUe, lending further credibili^ to current assumptions 
about the interacticm of these knowledge structures in DD's list encoding and retrieval. 
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Coatextual Coding 

AltlKM]^ die evidence reviewed in the preceding sections indicates that DD*s semantic 
codinj{ system and retrieval structure are critical components of his sldll, the operatkm of these 
me ch a nisms alone does not explam all aspects of his petfoonance oa the digit-i^an task and 
related experiments. For instance, protocols taken when DD*s span was in the 80-100 digit range 
(lik. diose in Tables 4 and 5) showed tibat DD's list encoding involves more than isqxxring 
meaningful imeipretations <m systematically parsed list segments. Also, die development of his 
semantic coding ^stem and retrieval structure cannot account for improvements in DD*s digit- 
span that occurred after these mechanissis were well established. After 3 years of practice (when 
his span had not yet surpassed SF's) verbal {votocols taken in regular practice sessions suggested 
diat these mechanisms were intact and operaticmaL Subsequent monitoring of DD*s protocols, 
studies designed to examine the structure and function' of die«e mechanisms, and replications of 
these studies at later points in DD's practice also indicated that his codhig and retrieval ptxicesses, 
as thev relate to these mechanisms, remained quite, stable. This in^lies dut an additiraal 
mechanism or additional mechanisms were developed and/or refined to Uft his span to its peaL 

Similar reascming by Chase and Ericsscm (1981) in their analysis of SF's skill led diem lO 
propose that practice-related speednq) of memory processes was an essential part of the 
development of skilled memory. They argued duit ^fter SF had estatdished h^ semantic coding 
system and retrieval structure, increased efBctency in theL operation as a res^slt of practice 
accounted fot subsequent improvements in SF's digit-span. Gmsistent widi this argument were 
data showing that the time SF needed to encode digit groi^ decreased monotonically with 
practice. 

This finding generalizes to DD. Usmg a self-paced list {msentation procedure to measure 
DD's encodhig speH at yeariy intervals, steacfy decreases in '^is encoding times (ptr digit group) 
were seen over his durd, fourth, and fifth yearr of practke. But anodier important and togicaUy- 
refaied devetopment acconqMuiied improvements in his coding efficiency. 

Over this period, the emergence of a new coding mechanism was revealed in both the 
temporal diaracter of DD's serial recall and his verbal reports. Widi increasing frequency, pairs 
ci digit groups and tripletj dut composed supergroiq) units were recalled with unusually short 
intergroup interval'' la his retrospective protocols, DD regularly rqnmed coding more than just 
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the meaning and location of these sequences. Almost invariably, he rq)orted explicitly codhig 
lelations UMmg tbt symbolic elements he held in memory. Together, these two related 
phernxnena suggested that increased encoding efiRciency had qxiwned a new mechanism for 
encoding higher-order patterns of infcmnatica. 

These patterns, and the labels DD uses to rqpresent ttiem, belong to ttie broad class of 
encodings called contextual codes. Contextual codmg refers to DD*8 rqxesentatioa of a wide 
varied of lelations that he discovers as he receives a list on a digit-span What differentiates 
his citatioa of contextual codes from his encoding of retrieval structure locatioos and leouuitic 
encodingofdigitgroapsisdteniegularandvariabtenatureofc^ WhereasDD 
invariably encodes the relative iocaticn and semantic content of each digit group in a list, the 
contextual codes he creates, if any, dqpend XKpM contextual variaUes such as (be contents of a 
particular trial list, tte coaiesM of any jmceding lists, and his repitMnUtion of list contents. It is 
hnportant to enq^ize toe diat tiie content information available to DD during list presentati(m 
is not equivalent to diat available to a novice, due to DD*8 knowledge-based pattern recognition 
capabilities. 

What do contextual codes consist of and wh&t accounts for their creation? Beneath the 
sup^ficial diversity that characterizes DD*s ccAtextual codmg frcxn trial to trial, orderly ""deep 
stnictures- exisl. Analysis of DD*s retrospective verbal reports from over 100 digit-span trials 
has revealed several abstract categories of contextual codes. Typically^ these categories are 
identified in his protocols by distinctive verbal labels. Tables 4 and 5 contain a few such as 
"back-to*back,- "add-em up," "faster ftan," and \ ottier Isemantic code]." 

fin these protocols DD rqports coding relations between semantic codes whose creation can 
be sq^arated by eithez a little or a lot of time and processing activity. Such relations link (a) 
semantic codes aeated for digit groq>s presented contiguously within the same digit*span trial, 
(b) codes hi the same list whose creation is sqmrated bodi by time and DD's codhig of 
intervening digit groups, and c) codes from different lists presented hi the same sesrion. More 
concretely, in Table 4, DD recognizes ttut the third and foivm digit gro:^ 
6*minnte range. He retotes these items by coding die second sequence as a faster time than the 
first b a shnibu fashion, he reports that two contiguous sequences (6938 and 3802) occunring 
later in the same list weie bodi encoded as 10 mile times and differentiated on the basis of thek 
asrignmenttosid)categorieswithhithelO*milecat^ory. Likewise, at several pohits hi Table 4, 
DD tepom having noticed duu codes for contiguous digit groi^ have redundant elements. 
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Redundancy is not the only basis for relating contiguous codes, however. DD regularly 
recognizes and encodes sequences of semantic codes that occur in ascending (e g* *lia!f-mile, 
three-quarter, one*niile**) or descending (e.g., "three^^ rterHnile, half, quarter**) order within his 
semantic coding system. He also notices and encodes alternating sequences codes 5uch as 
**age, mile, age,** or **two*mile, ten-mile, two-mile.** Interestingly, his coding of such relaticms 
between tr^ts of semantic codes is restricted to simations in which redundant (e.g*, **nile, mile, 
mile**), ascending, descending, or alternating codes occur within a 8iq»ergroiq> unit of his retrieval 
structure. This constraii % suggests tbat abstract locaticm information and semantic codes serve as 
building material for contextual codes* 

Contiguity is not essential for DD to notice semantically similar codes and relate diem. In 
Table 4 that he reports noticing that 9390 is nearly identical to the sequence 9393 diat occurred in 
a previous trial. The scope of such discoven^ relations sometimes extends across sessions. On 
one occasicm, DD noticed that a particular digit sequence he was encoding had occurred in an 
identical position in a list m the previous day's session. A check of the lists presented on the 
previous day verified DD's observation. This anecdote, and, in general, DD's ability to discover 
redundancies <tf the type described here reflects two salient characteristics of his skill. The first is 
a remarkable retention of inf(»mation over intervals of considerable length, during which a 
wealth of potentially irierfering information is encoded. The second is his ability to recognize 
the threat of mterfe^^e that redundant coding of digit grotq)s creates and to encode relational 
information that links and differentiates the redundant codes simultaneously. The result is a 
unique memory code which, in theory, resists interference. 

DD's encoding of different relational patterns of information is not restricted in content to 
semantic codes. He also rqports noticing a variety of relations among digits. For example. 
Tables 4 and S show several instances in which he notices the repetitim of contiguous digits, both 
witinn and between digit gmsp boundaries. The phrase **back-to-back** typically identifies 
contiguous and redundant symbols, be they semantic codes or digits. He also frequentiy ttpofrts 
coding synunetric relatiras between mdtvidual digits and pairs of digits which he codes witii the 
label **frontwards/backwards** (cf . DD's coding of the sequoice 8558 hi TaUe 4). His use of this 
latter category tebel for a variety of different contents (e.g., 191, 8558, age-mile-age) iUustiates 
the abstract nature of his ccmtextual codmg patterns. 

His protocols show that arithmetic relations among digits and digit groiq>s form the basis for 
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another class of contextual codes. There are frequent occasions on which DD rqK>rts coding 
pairs of digits within 4-digit groupi in terms of the difference between the quantities represented 
by each pair. For example, in Table 4, DD reports coding Ae groups 4346 and 4S48 in terms of 
an attribute denoting a subtractive relation (e.g., '*apait'*) and the value **three." Additive relations 
aie also noted for several digit groups and identified by the label **add-*em-up.** This label is 
applied when scmie subset of digits within a group sum to eider another digit or combination of 
digits within tibat group. The protocols related to the digit groups 352, 642, 716, and 021 in 
Table 4 reveal several instances in which this goseral relation is encoded In the case of the 
contiguous sequences 352 and 642, DD explicitly mentions noticmg a double redundancy in his 
coding of these items; fiie pair are coded as being both **back*to-back** 1-mile times and 
••add-^em-ups." 

Evidence suggests that DD uses contextual coding to reduce memory interference on the 
long trials he received at advanced stages of practice. The longer lists DD gets as a result of 
inq)roving his digit-span increase the amount of potentially interfering infonnaticm wifii which be 
must deal. This idea is consistent with the view that interference is file principal threat to recall of 
information stored in LTM (Anderson, 1985; Ciowder, 1976). Becaure DD's creation of 
contextual codes is a iorm of elaborative encoding, this activity should enhance retention, 
provided it iocs not hinder other **regular** ceding operations. 

Several sources of evidence show th^t interference is a very real threat to success on 
extended digit-span trials. Chase and Ericsson (1982) have shown for both SF and DD that 
accuracy of serial recaD diminishes as a function of trial order widiin a practice session and that 
list rehearsal time^ increases. In addition, the probability of correctly recalling a digit group in 
postsession free recall increases as a function of trial order. Subsequent work wifii DD has 
replicated these findings, although the magnitude of tlie effects has diminished with practice, 
when list length is held constant. Further evidence for interference has come from studies using 
error analysis and protocol analysis. These studies relate serial recall errors to confusion of new 
informatioii with information encoded earlier within a list or in previous trials. Regularities in 
these errors suggest duit they are due to confusion of specific types of mformation created by 
DD*8 difiTerent coding mechanisms. For example, errors involving the transposition of entire 



^RehMinJ time it defined as the time between prestation of the last digit in a trial lift and the point at which DD 
begins hit serial racalL 
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digit groups from different sv^pexpoap clusters implies ccnifusion of retrieval structure locations 
within a list. Other errm car be attributed to the semantic smuUmity of incorrectly recalled digit 
groups to previously jn^sented digit groups. 

Experimental evidence supports the idea that contextual coding enhances DD*8 list retention 
by reducing interference between As memory codes during aerial recall. In a series of 
experimental sessiras DD was presented with SO-digit lists whose contents were not randomly 
generated, but cc efully constructed to manipulate die oppommities for contextual coding they 
presented. ^lEnricled** lists provided many potential opportunities for contextual coding, whereas 
'^depleted'* lists were designed to minimize die number of such opp<»tunities. It it fanportant to 
note that manipulation did not in any way alter DD's normal list parsing, semantic encoding of 
digit-groups, or r^eval structure indexing. 

Table 8 presents measures of DD's serial recall perfonnance as a function of list type. 
DD's serial recall was near^)erfect and did not differ as a function of condition. Consistent with 
expectations, indices of retrieval speed show that tiie contents of enriched lists were recalled 
much more quickly that those of the depleted lists. Analysis of recall times as a function of list 
type and trial order within a session siqjpOTted the hypothesis that contextual coding reduces LTM 
iaterference. On the first trial of each session, where interference should be minimal, recall times 
were neariy equivalent for both list types. Recall times rose sharply for depleted lists on 
subsequent trials within a session, where increased interference is expected. Recall times for 
enriched lists did not increase iq)preciably until tiie fifth and sixth trials, and then only modestly 
relative to increases observed for depleted lists. 

This worl: provides converging evidence for DD's encoding of contextual relations and 
suggests ^t this activity pUys an important, but not indispensible role in enabling him to 
achieve neariy perfect seiial recall of n^idly presented SCMiigit lisu. It seems likely dut 
contextual coding phiys & much more important role in achievmg perfect serial recall as the 
length dT trial lists increases. Consistent with the view of Chase and Ericsson (1982) and in 
siqpport of die theofy that motivated diis snidy, contextual coding appears to be a mechanism 
emfdoyed to reduce die interference that remains, even after well-elaborated memory traces have 
been credited using mechanisms :hat semantkally encode short random sequences and code their 
Ofdrud relations. 
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An iiQpoftant Aeoretical point is that DD's contextual coding rqmsents an emergent 
mechanism used to achieve exceptional memory performance. This mechanism can be 
understood in terms of die development of skilled memory. The coding strategies and structures 
related to cmtextual coding are based on information created by DD's use of a retrieval structure 
and a similarly organized body of semantic knowledge. Practice with these mechanisms 
increased the efiBciency of their operation. The c(xisequence is that large amounts of task- 
relevant infonnation are encoded, accessed, and recalled both reliably and efficiently. This 
efficiency is imponuix in that it provides DD with the resources to strategically process 
mfoimation available to him in a woridng memory whose cipacity is expanded by efficient JTM 
coding and retrieval mechanisms. 

Some General Conclusi<»s About DD*s Skill 

CoUectively, studies of DD's skill mppon the hypothesis that DD creates richly elaborated 
LTM rq^resentaticms for the materials presented oa digit-span trials. Various sources of evidence 
described in this npon suggest that the composition of these rq)resentati(ms is consistent with the 
f.bstract tripartite strucnire originally proposed by Chase and Ericsson (1982) and shown 
schematically in Rgure 8. In addition, these studies show that the same mechanisms play 
inqxMtant roles in retrieving stored information. 

Recently, developers of computer models of human learning and skilled performance have 
discovered the advantages of combining uifcrmation from different knowledge modules within a 
system to produce intelligent behavior (Newell, 1989, 1990). Consistent with these findings, this 
analysis shows that DD's skill can be decomposed mto separate knowledge coo^ponents whose 
coordinated interaction produces extraordinarily high levels of skilled performance. 

To sum up, DD's exceptiraal memory skill results bom the intenction of practice, 
knowledge, and strategies. At the start of his training DD exhibited no unusual general aptimdes 
or menmy abilities. He was selected, however, for his familiarity with a particuhr body of 
knowledge that he couM bring to his training and given an effective strategy for inlying that 
knowledge to extend his digit-^an. 

Throu^ practice DD developed new resources for handling die memory demands of longer 
and longer lists. Practice widi the strategies be was given (and ttie memory mechanisms he 
devek)ped to implement tiiese strategies) improved his performance in the digit-span task until his 
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mogt^ brought him to the limitations of these mechanisms* Their ase, however, provided him 
^!t^ the resources to fesbion and implement new st^itegies for dealing with the demanls that 
K^jiger lists posed. His exploitation of the resources available to him - which include a large and 
veiy well organized knowledge base, a wide reoeitoiie of encoding strategies, and enhanced 
infoimaticm procesr ng efficiency - enabled hun to increase his infonnation^frocessing capacity 
and extend the Umits of human perfcnmance on the digit-span task. 

I emphasize what I see as the most novel and important contributi(»i this research makes to 
understanding human expertise. In the literature, there is nearly universal agreement that a 
coroentone of expeitise is experts* ability to rapidly encode global patterns of familiar, task* 
relevant information (Chase, 1986; dase a jd Simon, 1973a, 1973b; Chase A Ericsson, 1982; 
Ericsson & StaszewsB, 1989; Glaser & Chi, 1988; Newell, 1990; Olson & Rcuter, 1987; Posner, 
1988; Tech. Rept 88-1, 894, 89-2). With the excepti(Mis of studies carried out in die context of 
SMT, few studies address how these patterns or chunks are represented, retrieved, and used 

Studies of DD*s skill go further. They catalogue the variations m content^ structure, and 
complexity of the patterns an exp2rt strategically creates to achiev^ exceptional performance. 
They also dissect the knowledge base that supports high-level pattern recognition capabilities, 
identifying specific mechanisms used to create one level of pattern information and whose 
efficient and interactive operations generate tibe material and resources for the creation of hi^ier- 
order pattern. They also show the same mechanisms are used to retrieve the encoded patterns. 
In describing how practice, knowledge, and strategies relate to expert pextonnance, these studies 
describe how specific component mechanisms of DD*s skill internet with each other and with 
practice to support the development of new and ad^tive knowledge structures and memory 
coding processes. 

Expert Mental Calcniatioa: Structure fMd Process 

Several sources of evidence show that the mental calculation trainees* memory management 
strategies are consistent with the princ^les of skilled memory. The three following sections 
outline the mxppcfti for this claim. Detailed analyses of the . ^ees' computational procedures 
also show that efficient computational strategies can reduce the memory demands of mental 
cafculation and that their adoption can improve performance. 
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Evidence for Skilled Memory 

Retrieval stnictares in expert mental calculation 

One source of evidence for retrieval structures are concurrent verbal protocols collected 
fixm GG and JA. A|>proximately 25-30 hours of verbakxcd solutions have been collected bom 
each. These data were obtained by asking the trainees to '^Ihink aloud** (Ericsson & Simon* 1980; 
Newell & Simon, 1972) as they solved problems just like those presented in practice sessions in 
terms of size and mode of presenuticm. 

Clear evidence for retrieval structures comes bom the manner in which both trainees 
structure problem multiplicands* intermediate results* and products in their protocols. For 
instance* when GG encodes a five-digit multq)licand iqxm presenutioo* the pame between his 
enunciation of the second and diird digif. is noticeably longer that those aepariOing other 
consecutive pair of digits (whkh are approximately equivalent). The same temporal pattern 
characterizes his subsequent referraces to this operand wi&in a solution protocol. Itie findhig 
that this pattern is ccssiaiently observed for the vast majority of five-digit multiplicands in the 
lxS*s and 2x5's indicates that this is an abstract represenutiooal format* Tbe pattern of pauses 
sugi^ests that this format is hierarchically strucnued* consis Jng of two intermediate level abstract 
units* the first containing the first two digits and the second containing the re m a ining three. The 
temporal stnjscture of his four-digit multiplk:ands suggests a abstract hierarchical itructute 
containing two groups of two digits. Similar regularities are evident for his structiffiug of 
intermediate results and final products* aldunigh it appears that the organizational foraiat used to 
represent any number that Mf^eais in his compulations depenAa txpoa variables such as its 
magnitude* the size of the problem* and the function it serves in ccmiputaticm (problem operand* 
faitermediate result* or fmal product). JA's protocols reveal siniilar structural reguhrities in his 
representation of numbers* however the specific formats used by JA differ from GG*s. 

Chronometric dau have been used to confirm these observations. Because listening to the 
muttipUcands recalled by the trainees in post-practice problem recall sessions suggested the same 
sort of tenqx>ral organization seen in their concurrent protocols* the final bet recall protocols 
provided an opportunity to test notions of retrieval structure organization based on the concurrent 
protocols. Therefore* predictions were nuule about Ae strucnire of multiplicands for different 
sized problems for each of flie r mecs and then tested by measuring the pauses between the 
individual digits of the multiplicands recalled in fmal free recall. 
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For example, die tree-stnictures drawn in tfae righMumd corner of each of the plots in Figure 
9 represent die retrieval strocture organizations predicted for GG's 3*, 4*, and 5-digit 
multiplicands. Note that predicting a single structure for all probleois with t common 
multiplicand size, disregarding die variables of mult^lier size (Ix or 2x), problem presentation 
mode (oral or visual), and multq)lier value, implies that a conmxm abstract structure is used to 
code many varied instances. 

Making die same assimiplion used test models of DD*s retrieval structure organization (Le., 
that a tree traversal process is used to access digits represented by the terminal nodes of a 
hierarchical associative memory structure), longer pau. are predicted between pairs of digits 
that span retrieval strucnire unit boundaries. The data is plotted as « function intervals 
between consecutive pairs cf digits and problem presentation mode, represent means for a 
minimum of 50 observaticms. The important finding is that statistically reliable increases in 
pause times occur at die predicted inter-digit intervals, confirming that retrieval structures are 
used to store and access task-critical inforaiation in LTM. 

Fuither evidence validating the trainee^s use <tf retrieval structures comes iSrom die 
successful prediction of the trainees* solution times achieved by process models discussed hi a 
buer section of diis report. A more detailed description of die retrieval structures employed by 
the trainees is found in Technical Report 88 2. 

Evidrace for chnnldng 

A characteristic of expert me^ital calculates repeatedly cited in the literature is an extensive 
knowledge base of interrelated and easily accessible number facts (Ball 1892; Bidder, 1356; 
Bryan, Undley, & Harter, 1941; Hunter, 1962, 1977; Jakobsson, 1944; Mi^hell, 1907; MueUer, 
1911; Sandor, 1932; Smith, 1983). For example. Hunter (1977) reported duu Aidcen could 
''automatically'* report wbedier asqr number iq) to 1500 is a prime or not an^ if not, immediately 
give its factors. Bxyan, Undley, & Harter (1941) itponeA di&t anodier expeft^ AO, knew 1)y 
heart" t^mult^licationtaUes^ to 130 X 130,diesquaresof all numbers tq) to 130, die cubes of 
numben iq> to 100, fomth powers iq[> to 20, and more. Consistent widi diese reports, AB exhibits 
a similariy elaborate knowledge base <tf number facts and rektions, which he ttpons to have 
d^vekped not duou^ mtentional memcMization but, natumlly, through years <^ practice at 
squaring and multiplying. 
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In spite of the limited amount of practice GG and JA have had relative to the expeits cited 
above, several kinds of evidence indicate that each has acquired a stofs of declarative knowledge 
which resembles the knowledge acquired by lifelong experts in use, if not in volume. 

Tbc first signs of such development occurred for both trainees at a remarkably similar point 
in dieir training. Around their 200th session, both showed an increasing incidence of unusually 
fast solution times on ceitain 1 x 2s. The times for these problems were closer to ttioae normally 
observed for idenC.ty problems (whose multiplier is 1) and '*decade'* problems (whose 
multqriicands are multq)les of 10) than to the times for problems whanc products presumably had 
to be computed rather than retrieved from memory. When questkmed about these instances, both 
OG ar^d JA invariably reported recognizing a familiar pix)blem and consequently deviating Irom 
their usual solution procedures. In many of these instances, ttiey rq)orted immediately 
*1mowing** an answer upon receipt of a problem. In additicm, botib tepotttd occasi(»ially iK)tk:ing 
such funiliar problems embedded m larger problems and altering ttieir cooofnitatiooal plans 
accon?'ngIy. On the basis of these reports, post*trial retrospective protocds were taken from GG 
and JA during practke sessions on a regular basis to detemune the frequency and circumstances 
under whkh such events occurred. In addition, the frequency with wt^ch concurrent protocols 
were taken wu increased. 

The retrospective and C(Hicurrent protocol data both showed evidence consistent with the 
mnemonic encoding principle of skilled memory. The retrospective protocols showed that the 
frequency with which the tnunees* repoit txHicing fEuniliar subproblems with jHactice problems 
increased furly steadily with practice. Between sessions 226-235, both rqxnted nothing such 
subproblems oa about 30% of the practice trials on which such pattern recognition can occur. 
These percentages stood at 89% for GG after 450 session and 64% for JA at the end of his 
practke. 

The concurrent protocols show these numeric patterns are treated in two qualitatively 
differeni ways in the course of computation. First, these patterns are expressed are quantities 
rather than as concalenjUioDS of single digits. For histance, with the problem 25x4, the 
multiplicand would be expressed as '*twenty-five,** rather than as a ordered pair of digits (i.e., 
''two, five*). Second, such familiar patterns are distinguished by the ways their products are 
produced. Typically, there is no record of intermediate computation intervening between 
attention being paid to a familiar sut^oblems operands and the generation of its product, which 
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occm almost instatitly. This is consistent with the veiy fast solution times observed for sxicb 1x2 
problems in <:tice. 

In what sense are these panems meaningful? The answer is best revealed by the trainees* 
protocols. CoDcunent protocols show that theik^ recognition of particular patterns leads to the 
selection of one of a variety of fiuriy local compoutional strategic These 
stra^gies are local in two senses. First, die patterns on mliic^th^ are based are smaU and cons^ 
ci two elements, a multiplier und mult4>ltcand OG*s patterns, even at the conclusion ot Us 
nafaiing, were rarely larger don 1 X 2. Tikis was generally true for JA, aldioi^ occasionally be 
rqxxted encoding patterns as large as 1x4. ll)eliiniled8izeokdiesepatteim,irtiichfiUb widiin 
estimates of STM capacity, suggests the constraint duit STM imposes on codisg processes. 
Second, these strategies are local in die senjrc dut tb^ are implemented widiin the larger 
stereotypic control strucnires diat represent die genend left-teHrighi algoriduns ttud JA and GO 
use for one- (Ix) and tw^vjdace (2x) multiplicadcm. Essentially, die patterns that 00 and JA 
recognize and encode rq>resent familiar subproblems which can be solved efiHcienUy widi 
specialized strategies. 

Bodi the patterns and strategies to which particular patterns relate diflfer for die trainees. In 
general, JA has a larger repertoire of strategies and a greater variety of pattern classes to which 
diey are related Like 00, JA has ""expanded his muttiplication Ubles** so dut diexe are a wkty 
of 1 X 2s whose products he can retrieve and tepon in a second or less. His ability to Uendfy 
quickly die factors of particular numbers enables him to combine fiacioring and retrieval as a 
means of solving certain problems and subproblems. In addidon, in die course of practice JA 
discovered an abstract pattern of results related to an abstract class of problems dut led him to 
deviK a computadonal strategy whose back: procedures resemble tboie taught in die 
'nacKUnberg system of speeded martiematkal computation (CuUer ft McShane, 1960). 
Basically, diis strategy is a rule-based computational system i^licable to problems or 
subproblems whose multq>lier is 9 and whose mult^licand is a sequence of digits that are ddier 
identical (e.g., 9 X 444) or else ascend or descend in units of ddier 1 or 2 (^^^ 
X 234, etc.)- Es^loidng die redundancies in. die products of such problems, JAs strategy enables 
hhn to eliminate addition operations from his conqmtadons , dius saving hhn time. 

Aldxwgh 00*s strategies are less varied and original dun JA*s, his strategies; illustrate how 
npid pattern recognition and eCGcient strategy use can improve calculation ^leeJ. TaUe 9 
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iAiesents the principal results of a study that investigated the relation between hii^ strategy use and 
pcrfonnancc. 

In this study the efiiciency of CQ's strategies was examined by testing him on the entire 
population of 1 x 2 multiplication proUems (excludirg problems dut use 0 as a multiplier). In 
the oftl condition, each of a set of 810 problems was read jiust as in oral practice blocks. In the 
visual condition, problems were presented visually on a CRT. P)robIems in each condition were 
preaented in random order, and aj^xoximately 100 tials were presented in each of 8 sessions 
conducted on cou^ecutive days. On each trial 00*s task was to report the product of the 
presented prt^lem as quickly as possible and afterward report the strategy he used to solve the 
problem. 

Four hnsk strategies previously observed in GG*s protocols were rq)orted. His Identity 
strategy, applied to problems with a multiplier of one, is intuitively obvious; GG would singly 
report die multiplicand that had bt^n presented. He described his second strategy, labeUed 
R^neval, as one in which he would repoit the product that he '*unmediately knew** t^pon proUem 
presentation. CHj*s other two 1 x 2 strategies involved sequential arithmetic operations, in 
contrast to the two aheady mentioned. The strategy kbelled Cafculation invoWed solving 1x2$ 
in die way Chat GG originally solved them at the beginning of bis trainings just as novices would, 
using two operations to generate simple products and a third q>fntion to add them. His 
remaining strategy, labeUed Grouping , rq)resents an abbreviated version of full computation. 
According to GG, this procedure involves two consciously controUed stq>s. The fint o ^don, 
he reports, is his immediate and simultaneous r^eval of two simple products upon receipt of a 
problem. The second operation is their addition. Note duit widi die excepdon of problems on 
which die Ttdr computation strategy is used^ GG*s concturent verbal protocols indk:ate that be 
represents multq)licands as single quantities or chunks nuher dun as disa ^ symbols. This 
suggests that die patterns driving stmtegy selection aie specific pairs of quantities. 

TaUe 9 shews die proportion of trials on which each strategy was onployed for both 
presentation conditions, and GG*s mean reaction times aggregated as a function of r^xxted 
sotation strategy. Because detaUed presentation of diese results exceeds the scope of die present 
discussion, die data win be used to make difee general poinu. First, die '^fiill'* strategy is used in 
only about S% of die situaticms where GG used it as a novice. Second, die "retrieval" and 
"grouping" strategies prodvct solutioos much more quickly than full calculation, li is also the 



ERIC 



30 



37 



case that these strategies produce fewer intermediate results^ an important consideratioo h coat 
sudi reiolts rcpn$eax potential sources of interference when it is time to retrieve a product for 
output llurd, GG*s average response latency (the interval between problem presentation and 
reqxnse initiation) for all visoally presented 1 x 2s (Mean » 692 msec, SD « 136) approximates 
sohuion times that unpcacticed adults produce in solving \isually presented tfani^e (1 x 1) 
multipiication problems (Aiken & Williams, 1973; Campbell, ^987). In gcaeral, these findings 
iOuitrate bow knowledge ncquired through practice can produce dramatic improvements in 
calculation speed The knowledge referred to here consists of meaningful patterns of information 
associated with specialized computatiooal strategies. These patterns meaningful in the sense 
tm they are explicitly encoded and used to enable JA and OG to achieve their principle goal, to 
solve multiplication problems as quickly and as accurately a& possible. 

The general poim here \ti dut AB, GG, and JA all exhibit a form of mnemonic coding that 
resembles the pattern recognition aq>abilities of experts ftom other dooudns (Chase Sc Simon, 
1973a, 1973b; Eisenstadt & Kareev, 1975; Reitman, 1975). Throu^ extensive practice widi a 
wide variety of problems, these experts have learned to recognize multi-digit patterns that 
nmdomly occur both in isolation atxl ^bedded ui larger problems. ITiese patterns are 
meaningful in the scns^ that they are linked to specific computational strategies that reduce 
cafculation times. In their discussion of the role of pattern recognitic;^ in the play of 
chessmasters, Simon & Chase (1973; Chase & Simon, 1973b) suggested that the patterns experts 
bold in memory are linked to plausible '^good" moves. As a result, their ability to rapidly 
recognize familiar patterns enables them to select moves more efi&^^ently than less skilled 
players. The current work shows that similar IaK>wledge*based pattern recognition r^i^Mbilities 
enable expert mental calculators to employ computaticmal algorithms that decrease soluticm times. 
Thus, this work ex{^citly links complex pattern recogmtion with strategy selection: and high*level 
performance. 

It is theoretKally significant diat Siegicr*i; studies of children's arithmetic (Siegler & 
Jenkins, 1989; Siegler ft Shrager, 1984) suow that children*s iddlls parallel those GG and JA in 
several respects. For instance, children discover and emptoy a vari^ of con^mtational strategies 
for solving simple arithmetic problems. In addition^ there is good evidence that their strategy 
selectioo is apparently determined by their recogiiition of spectfk familiar cotnuinations of 
problem o\ rands. Finally, as their skills inwove with practice, memory retrieval replaces multi- 
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stq> computatioD as the preferred sohiticm strategy for an increasing number of {roblems. These 
paralleb suggest a fundamental ccmtinuity in the stall acquisition process across age levels, 
imctice levels, and tasks. 

To sum up, it appears that expert mental calculatCHS use semantic memory in three principal 
ways m achieve fast and accurate performance. First, consistent with Skilled Memory Tbeory^s 
mnemonic eacoding priix^ple, they use an elaborately interrdate£ krmwledge base to recognize 
ami encode meaningful patterns of numbers ttutt occur either as problems or embedded 
subproblems, dius prcxnoting their retention. Second, much like chessmasters apparently ust 
dieir pattern recognition o^abilities to ef^ciently select effective chess moves* calculatioQ 
experts use their unique pattern recognition capabilities to select efiBdent computaticmal 
strategies on a problem-by*fffoUem basis. Finally, experts use tibeir knowledge to regiBCt 
computation with retrieval as a means of generating products and intermediate results, thereby 
decreasing solution times. Just as SF and DD became expert mnem(»iists by learning to use 
semantic mesuoiy to encocte meaningful patterns of digits, this work show ttuU GG and JA have 
devek)ped knowledge bases which ^y use in a similar fashion to beome experts in the domain 
of mental calculation* 

Evidence for speed-up 

The speed-vp seen in the trainees' solmioi times implies an underlying increase in the 
speed with which meaningful patterns are recognized, eiKoded with retrieval stinctures, and later 
retrieved. These data do not provide conclusive suppcxt for SMT s speedmp prtncq>Ie, however, 
hnimvements in sohition speed can result also £rom the discovo^* and use of efficient 
computational algorithms that decrease botti the processing and mtmory denumds related to 
computing solutions. Clearer sui^>ort for die speed-up pdndpit comes firom data that relate 
speed-up more directly to memory encoding and retrieval processes. 

Figure 10 plots GG*s learning curves crwbining curves for orally and visually presnted 
problems of corresponding problem size in each plot. The shaded area hi each pkM dqricts the 
visual advantage, that is, the amount of time by which mean soiudon time for oralty presented 
problems exceeds die mean for visually presented problems. This measure is obtained by 
subtracting the mean sohiticm time for visually presrated problems bom the mean for orally 
presented problems for each block of 5 practice sessicms. Unshaded areas between the two 
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fimcticms indicate blocks in which problems presented orally were solved nu>re qiuckly than 
visual problems. 

The most salient ccmmion feature of these plots is the reduction in shaded area as a function 
of practice. This reflects a gradual omvergence of soluticm tunes for die oral and visual 
I^esentation. Statistical cmnparison of (ml and visual soluticm times from sessions 476-500 
reveak that a reliable difference between means occurs only for 1 x Is; here, (Mrally presented 
problems are solved mote quickly, in additicm, a ccmsistent but ncm-significant oral advantage is 
obseived for 1 x 2s and 1 x 3s. Considering GG's ability to use direct r^eval to solve a good 
propoctioo of 1 X 2 problems, presentation of the first problem operand in the oral condition may 
prime associated patterns in semantic masuxy and lead to faster retrieval times than diose found 
ill the visual conditi(»L For all other problems sizes, die means still reflect a slight visual 
advantage that is swanq)ed by the variability in soluticxi times. The main point, however, is that 
the visual advantage evident in the eariy stages of practke diminishes in all cases with i»actice. 
Tbe interpretation of this trend is that GG' s skills at storing and retrieving proUem infcnmation in 
LTM have in^>roved widi practice to a point where be encodes and (grates \sposi internal 
reptesentatiOTS neariy as quickly as he processes external representations. 

The same general pattern, that of a reduction of tbe visual advantage as a function of 
practke for al. ^ oblem sizes larger than lx3*s, is also observed for TA. Consistent with this 
fvactice related trend, further eviaeoce for the speed-up princ^le i? seen in AB's performance. 
His solution times show no reliable differences as a function of presentation mode over the same 
range of problem sizes. 

The relevant difference between pr^entation conditions lies in the demands they place upon 
memory. Recall that, in the visual condition, problenr'q>erands are const? Jy available for 
inspection while tbe solution process proceeds, whereas in the oral condition geci^g the coi.^t 
answer to a problem depends on perfect retenticm <tf problem q>erands. Thus, it seems 
reasonable to assume diat the visual advantage stems from the extra tLne used to encode and 
retrieve problem qperands (and tb^ir constinient elements) in die oral condition* 

The k^ empirical finding is that die visual advantage evident in die early stages of practice 
and logically related to memory toad diminishes wit! practice. The interpretation of this trend is 
duu the trainees' skills in storing and retrieving problem information in LTM have improved widi 
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practice to a point where they oicode and operate upon internal representations nearly as quickly 
as they process external representaticms of the same informatioa This development is consistent 
with the speed-up principle of skilled memory. 

Procer Strategies, Capacities, and Performance 

Althou^ ttit evidence obtained indicates that skilled memory rej^esents <me way experts 
can increase information-procesi^ing efficiency, it is not ti:e only means. Automatizing 
IHOce'^sing qmtions rqiresents another (Schneider, Dumais, & Shifirin, 1984) is another. The 
use of efficient strategies is still another (dase & Ericsson, 1982; deng, 198S; Hunter, 1977; 
Simc»i, 1975; Sieger & Jenkins, 1989). Several findings from the mental calculation training 
study support this latter claim. 

First, the tnuning stu(fy (viginally omtrasted the effect of practicing tik Idt-to^right and 
conventional right-to-left methods of multiplicaticm on the assumptim that ligfatTimg mental 
caknilators use the most :^r«vient strategies (Chase & Ericsson, 1982; Hunter, 1977). The pattern 
of earty practice results obtained was consistent with this a8Sunq>tion. 

Secrad, analyses of ccmcurrent verbal rqxMts taken fiom JA and GG as they solved 
problems revealed a significant difference in the way in which they implemented left-tc-right 
strategy. Although their procedures are more similar dian different, sharing many crnnmoc 
features mcluding the use of chunkiug aod retrieval strucnires, the general strategies they applied 
on 2x problems cleariy differed in terms of the number of processing steps involved JA*s 
strategy was clearly more efficient ttsan GG*s, particulary in the number of operations that were 
devoted to maiitaining intermediate results in memory. 

Third, model-based analyses also showed differences in processing comfdexity between 
JA's and GG*s 2x strategies. Id fact, it was the protocol evid^e t^at prompted co::^stn]Ction of 
process-models of the trainees* procedures. Thepurpose was to represent ttieirthou^t processes 
in way that the relative efficiency of their ccmiputational strategies could be mear Jti objectively 
and precisely. 

Briefly, these models were designed with control structures that produced the sequerice of 
operations that characterized each tnunee*s solutioii procedures across different problems of 
varying size. A theoretically important feanire of the models was their assumption that both OG 
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mi lA used retrie\^ structures to encode and retrieve problem operands ana intennediate results 
m worlcing memcHy during their calculations. Because die protocols indicated that most of their 
arithmetic q)erations involved only elemr of operands and intermediate results, both models 
assumed that (a) all multiplication and addition operations are carried out on pairs of symbols 
lepreaenting single digits'^* and (b) retrieving individual symbds, nested within hierarchically 
organized knowledge rquresentations, carries with it die overiiead of traveling die abstract 
aichilecture used to organize and access diese symbols. Ihus, basic memory search and retrieval 
operatkms that access the isqputs for arithmetic operations ccnstitute a major porticm of the 
processing needed to execmeeadi simple arithmetic opentirainvoWedt^ A 
measure o{ task complexity related to solving a particular proUem is obtained by summing the 
number of elementary memory operations needed to execute the aridmietic, rehearsal, or 
reformatting operations that a solution algorithm dictates for a specifk [noblem. 

Once die trainees' procedures were captured in running i»ograms, the diffek^nt programs 
were given a test set of identical 2x problems. Comparison of the models' performance showed 
that JA*s method was more efficient dum GG*s in terms of the number of operaticms it required to 
produce scdutions. A more detailed examination of the models* performance as a function of 
[m)blem size showed Uiat die relative advantage of JA*s strategy varied proportionally widi 
problm size; JA's strategy was only marginally more efficient dian GG*s in sohdng 2 x 2s, but 
its margin of superiority increased monotoiucally with each inaease in multq)licand size. 

These findings were ccmsistenr with the pattern of results shown repeatedly when hVs and 
GG's solution times were compared at equivalent levels of practice using samples taken bt^e^^ 
sessions 100 and 300. GG*s ave^ige solution times were faster than JIA*s for all levels of Ix 
problems, roughly equivalent to JA*s times for 2 x 2s, and slower by an inaeasing murgin as 
pxblm size increased to 2 x Ss. 

To test the validity of diese models more direcUy, the same problems diat Utt trainees 
received in practice were fed into the simuk'^oa programs. Vx€ix runs produced estimates of task 
complexity for each problem of each problem-sL category. Ihese measures were used to 



^Thcw fisft-apjpiDxanadon moddt make no ptoviiionf for either trainee's ability to retrieve eolutiont to fcniliar 1 x 
2t er encode aul operate upon their double-digit openmda at unitary quantitief. Tbeiefoie. de^te die encoomging 
leaultf obtained using tfieae modds as analytic tods, dn^y hct fuitbft development and testing before they can be 
coofidsntfy regafded as complete, psydiologicaliy valid models of r}ie tnunees* sldHs. 
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predict the trainees*, solution times. The results weie both encouraging and infamative. First, 
across several samples both GG*8 and JA*s models could ccmsistently account for about 80% of 
the variance in dieir r. ,fcc6ve sohition times. The p£ rametcr estimates produced by these 
analyses inq>lied that JA took roughly 300 msec per assumed processing operation and 00 about 
250 msec, values correq)onding to independent estinutes of die durations of goal-directed 
cognitive operations (Newell, 1990; Simon, 1979). 

From this findmg, it follows that GG*s sohiticm times should be faster than JA*s on 
problems whose solutions required approximately the same number of openitio!!^^ Moreover, 
because the models show that GO*s sohitions for 2 x 3s, 2 x 4s, and 2 x Ss require increasingly 
more operations uj^m JA*s sohitions, the models predict that his section times should futfaer 
and farther behind JA*s as the size of 2x*s increases. These predictions fitted the pattern of 
results shown rq^eatedly when JA*s and GG*s sohition times were compared Thus, the model- 
based analyses offered an e?q)ianation for the differences obser;ed in trainees' performance. 
G0*8 relative advanuge ov^r JA in processing speed is reflected in his faster solution times on 
lx*s, but diis advantage is negated on the larger 2x problems by the additional processmg 
operations that his 2x algorithm requires. 

To directly test the hypothesis that the differences in the efficiency of JA*s and OG*s 2x 
compu^tional algorithms might account for this pattern of performance, the following 
experiment was perfc^med At die beginning of OG*s SOlst practice session, JA*s method for 2x 
calculation was described to him and he was instructed to use this new method in all subsequent 
pnctice sessions. In order to compare GG*s performance using die two strategies in as controlled 
a fiishion as possible, the problems prec^nted in sessions 401-300 were re-presrated in sessions 
501-600 in the same order and under their origiiial presentation conditions. 

Figure 11 plots GO*s average solution timts for sessions 476-300, sessiras in which he was 
still using his original 2x algorithm, and sessions 376-600, sessicms in which he was fairiy well 
practiced in using JA*s 2x algorithm. Comparison of the functions shows an improvement in 
sohition times for all problems sizes with practice. Ihe average improvement for Ix problms ib 
i^iproximately 7% and r^n-esents a baseline against which improvements due to the experimental 
manipulation can be measured Focusing on the times for 2x*s, the effects related to switching 
strategies are interesting in se\^eral respects. 
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First, the general pattern of im|MX>vement was predicted in advance by die simulation 
models' estimates of task ccHnplexity; solution times for 2 x 2s, 2 x 3s, 2 x 4s, and 2 x 5s showed 
decreases of 9%, 25%, 31%, and 40%, respectively. Note that the greatest decreases occur for the 
problem sizes on which GG's soluticm times are filter dian AB's (2 x 3's, 2 x 4'8, and 2 x 5's). 
The impUcaticm is that GG, left to his own devices, would have required considerably more 
practice to achieve die level of performance diat instnictioaal intervention has produced. This 
result demonstrates how isfcmnation obtained by analyzing expert performance can be used to 
'*engineu** human expertise in cognitive skills as well as perceptual skills (Biederman & Shifi^, 
1987). 

Second, GG was able to adapt to the new algoridun with surpnsingly little difficult^;. 
Quantitatively, close inspection of GG's 2x learning curves showed a relatively sma)! and 
temp^xary increase in GG's solution times immediately after switehing strategies. A sciall (2%) 
and temporary increase in his 2x error rates also occurred at the same point. Qualitatively, 
concun^t verbal protocols taken during die first few days of die strategy switch also showed a 
corrtspondinf decreiiSc in die fluency of GG's sequence operations. In retrospective rqpcms, 
he mentioned having to pay **a litUe closer attention** to sequencing his operations on each 2x 
trial, which he believed slowed him down. Significantly, bodi types of protocols revealled tiiat 
GG could encode problem operands and intermediate results via his retrieval stnictures and 
execute his pattern-driven computational strategies widiin die new algorithm witliout any 
i^ypaitnt difiiculty. 

Theories of skill acquisition (Fitts t Posner, 1967; Schneider & Shiffrin, 1977; ShifBrin*: 
Schneider, 1977) tiiat emfriuisize die development of automaticity predict diat high levels of 
practice under stable conditions produce relatively inflexible skills. While there is some evidence 
for die negative transfer predicted by such dieories m die current experiment, die performance 
decrement related to GG's switch from a fanuliar strategy to a novel one is trivial compared to 
diat shown when the task en^^dronment in yfMch subjecu had automatized dieir skills was altered 
radically (Bq^eriment 1, ShifiGrin & Schneider, 1977). The relative absence of negative transfer is 
not entirely surprising, because the hi^ variability built into die trainees' practice environment is 
not conducive to die development of aut(»naticity (Schneider, Dumais, ft Shififrin, 1984). 

Aldiough it 8cems likely dmt automatization of relatively low-level processes (pattern 
recognition, memory retrieval) makes an important ccmtribution to 'jO's impressive performance, 

48 



44 



the ease with which he adapted to ushg a new algorithm indicates a considerable degree of 
control and flexibiiity in his skill. This finding supports Bartlett's (1932) views on expertise and 
flexibility. This experiment also shows Oiat die acijuisition of expert-level skill is a complex 
process involving more than the automatization of mental operations (C3ieng, 1985) and the 
devetopment of skilled memory. Strategy discovery and use play inqi^vtant roles. 

4. Methodology: Implications for Theory 

The approach this project uses to analyze expertise and its develofsnent departs from the 
traditknal methods used by experimental psychologists and computer scientists in several 
ieq)ects. distinguishing features are described followed by a brief discussion of its advantage 
and disadvantages. 

Fnrst, it analyzes ccnnplex, goal-directed behavior that stretches often over several ifns of 
seconds* The activities studied are complex in the sense that tb^ require selection and 
coordination, particularly serial (vganization, of a variety of cognitive processes. The intent is to 
catalogue as completely as possible the key structures and processes that co:atribute to exceptional 
bunum performance as well as the coatr<A structures that ordiestrate their operation. 

Analysis of subjects' learning and performance is fme-grained Individuals rather than 
groups are studied, and their p^ormance is analyzed on a tria!-tyy*trial basis as they adi^t their 
activities to the demands of a pi>rticular task environment over long periods of practice. The 
danger of blindly averaging over subjects and trials is that subtle regularities in behavior that 
represent precise and flexible adaptation can be hidden. 

The analyses are comprehensive. Once again, die vxtent is to catalogue as ccmipletety as 
possible the key mechanisms that contribute to exceptional human performance and the contr<4 
structure(s) that govern their interaction. Therefore, multiple tasks are used to study subjects' 
performance (i.e., variants of the digit-span task, letter*q»an, wcHd-^Nm, the Luria Matrix, free 
recall, probed recall, etc.) often widi multq)le methods (verbal protocol analysis, experimental 
hypodiesis testing, simulatimi), using multiple measures (chronometric measures, accuracy, 
verbal reports). Such a strategy is designed to identify specialized subsystems mediating complex 
bdiaviosr and to test die range of their application. Also, keeping in mind that the phenom^ 
under investigation, knowledge and cognitive [Mrocesses, can only be studied indirectly, the use of 
multiple methods is extremely imp(Miant to incure construct validity. This multitask, 



45 



multimethod a{^roach assumes that the strategy of converging operations is the best route to the 
soundest scientific ccmclusions. 

The obvious disadvantage of the idiographic iq)proach is the difficulty of producuig findings 
that generalize across subjects. However, in spite of the relativefy few subjects studied under the 
rubric of ttie Skilled Memory Proj^t (by Chase, Ericsson, and Staszewski), rommon 
characteristics consistent with die tenents of SMT have been abstracted firoir dUiffercL. expert 
subjects from different skill dcmiaics. Of course, comprehensive analysis of even a single 
subject, let alone several, usmg multiple methods and measures is inherently expensive in terms 
of time and resources. 

Hopefully, the foregoing pages show the reader that the advantages of this i^praoch to 
studying expertise outweighs the costs. In general, rhe advantage of this qy(mach is that it can 
yield relatively detailed, coherent locai theories of expertise and its development from which 
more general theoretical principles can be abstracted In contrast to much of the previous 
research on expertiKe, this approach foltows the methodological advice Newell (1973) urged upon 
cognitive psychology for the sake of sustained theoretical ptogress. It aims for a anninebensive 
understanding of the structures, processes, knowledge that experts employ and the way in which 
these elements are organized as integrated, adaptive informaticm-inrocessing systems that produce 
goal-directed behavior. Newell (1990) cites th^ theoretical value of this i^iproach to 
understanding the complex information-processing that characterizes intelligent thought 

The longitudinal character of this approach is important because it reveals how intelligent 
systems adapt to the demands of particular tasks with experience. This apfmsK^h has shown how 
practiceHrelated changes in knowledge representations and {mcesses produce quantitative and 
qualitative changes in behavior. Whereas otiier qq>roaches to the study of expertise have led to 
the inference that acquired knowledge is its foundations, tiiis approach has succeeded m 
demonstrating the validity of tiiis claim* Siegler and Jenkins (1989) note how this approach to tiie 
study of leammg is particuhriy useful for understanding the relation between cognitive strategies 
and performan^, important but relatively neglected topic. Most importantiy, from the 
perq)ective of diis project*s objectives, Uiis approach has produced new empirical and theoretical 
insights into how ncmnal individuals overcome innate impediments to learning and perfcmning 
complex cognitive tasks to achieve levels of performance once considered beyond their 
capabilities. 
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5* Practical Implications 

Because this project falls into the categoi y of basic research, its main goal was to [ffoduce a 
body of theory and dau that would add to our scientifc undentandmg human expertise and its 
development Fundamentally, it has shown dut expert knowledge can be analyzed at a relatively 
fine grain and has demonstrated die value of such analyses for modelling expeit perfoimance and 
designing interventions that facilitate the developmmt of expertise. These accomplishments hold 
practical implications for the enterprise of Imowledge ecgineering*** 

In the context in which it was introduced (Feigenbaum, 1977), the phase ^knowledge 
engineering** referred to die development of expert systems by computer scientists studying 
artificial intelligence. Researchers and practicioners ge^ierally agree Chat extracting knowledge 
from experts so tiiat it can be rqrresenied in functional programs is peAq>s die most cnicial and 
difBcuIt aspect fo building sucL^ systems (01s<m & Renter, 1987; Waterman, 1986; Winston, 
1984). The direct knowledge extraction methods typkally used and taught account for dtis 
problem to no small degree. They are only suitt4 to identifying knowledge dut an expert can 
consciously access and communicate accurately. Kfuch psydiologicai evidence on the flaws of 
self-rqKm measures (Nisbett & Wilson, 1977) and the nature and penetrability of expert 
knowledge indicates that inherent limitations hamper conventional methods. 

This project*s successful modeling of aspects of its experts* performance suggests that its 
comprdiensive approach to knowledge extraction, combining a complmentary variety of direct 
and indirect mediods, offers a viable and potentially valuable alternative to die conventional 
approach, ^n addition, the tiieoretical accomplishments of this project offer knowledge engineers 
a body of knowledge tiiat can be used to guide dseir efforts in knowledge extraction. To die 
exteni( that diis project offers a body of theoretical and tucihodological princij^es diat computer 
scientists can apply, it can help make knowledge engineering less an art (Feigenbaum, 1977) and 
more die scientific activity diat the term engineering implies. 

The findings of diis project*s training studies also suggest that Icnowledge engineering** 
extends beyond die field of artificial intelligence to that of instructional design. They show tiiat 
cognitive science has die methods to discover and expUcitiy describe expert processbg strategies 
and duu interventions based on such research can be used to facilitate die development of 
expertise. Its description of knowledge structures su^iporting expertise suggests that organized 
semantic networics and retrieval stnKtures can be used by educators as targets for histruction and 
learning (Glaser, 1989; Glaser & Bassok, 1990). 
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6. Summary of Publicaiions 

Staszewsld, J. J. (1988). Skilled memocy in expert cnentnl calculatioiL In NL T. H. Chi, 
It OUser, & M. J. Farr (Ed), The nature qf expertise. Hillsdale, NJ: Eilbaum. 

Staszewski, J. J. (1990). Excq>tioQal memory: llie influence of practice and knowledge on the 
development of elabarative encoding strategiei . In W. Scbbeider & R E Weinert (Eds.)* 
Interactions among aptitudes, strategies, and knowledge in cognitive peTformance. New 
York: Springer-Verlag. , 

Ericsson, KL A., & Staszewski, J. J. (1989). Skilled memory and expertise: Mechanisms of 
exceptional performance. In D. Klafar & K. Kotovdcy (Eds.), Complex iitformation 
processing: The impact qf Herbert A. Simon. Hillsdale, NJ: Eribaun* 

Staszewski, J. J. (in preparation). Retrieval structures and exa^Hioril memcr;^ A chronometric 
an&lysis. 

Staszewski, J. J. (in {reparation). Semar>iic memory structures unJ. lying excq)tional memory 
perfonnance. 
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table 1 



Problem Size Cate§ertes 
Cate gory fe^mple 
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1 X 1 X 6 

37 

1 X 2. X 4 

895 

1 X 3 X 9 

1,472 

1 X 4 X 8 

91,41:3 

1x5 X 7 

— 73 

2 X 2 X 38 

856 

2 X 3 X 52 

4,957 

2 X 4 X 76 

31,265 

2 X 5 X 69 
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0 0 10 

0 5 10 

0 0 15 

0 0 5 



ProMsmSIzs 
1x4 1x5 2x2 

10 15 25 
0 25 20 

15 25 15 
e 0 0 



2x3 2x4 2xS 

55 70 90 

35 30 60 

40 41 75 

34 15 35 



FinaJ Thirty Sssslons 



ProblsmSIzt 



Prt— ntatlon 



Or^ 0 5 0 4 5 7 13 15 15 

QQ 

Visual 0 0 2 4 7 6 10 13 13 

Oral I 3 4 ^ 11 9 13 30 29 

Visual 0 54 56 5 19 16 
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Table 4 

Verbal Protocol: DO's Encoding of a 75-digit List 



DD's Report 

OK,ltst9P)u;)wasahalfmil«tOh,two,oh,four. I said oh. two. oh, four, 
half mil*. . 

And th«n, ah, I had bPck-to4)ackfour^ thtfs forty-nina ti9«aniy-sevan, a tan 
Ritta. and I aaid that two and aavan add up to that nina and had that foity- 
nina. 

Than, ah, flva, dght, thr**, two was a tan mfla and I juat said I got back-to- 
back tan rnOaa and than thathiM and two add up.to that Iva. IsaklOK, 
flva's tha fist digit and thaaa add up to it 

And. urn, than tha aightaan hundrsd i just said was a data. 

Andthan,um,sav«n,no,thfca,fiva,twowasamiiatima. IsaMifsarsai 
test miia tima and rs an add-'anHjp. 

And sfac four, two was « ffliia tima. WM an add-'anHip and I Just said. OK, 
tha/ra both add*'anHjpa. but th^raltatotaiiydifl^fant imaanonaisao 
much fastar than tha otharona, but thay wara both baek-to-back milaa, 
add*'anHJpa. 

And than Nni> two alg'<it was a two mihk 

Than forty-sb( flfty-aight wu a tan mila. and I juut said that was tweiva apait 
batwaan tha forty-six and Wy-aighL 

And th*.n forty-aavan Ifty-thraa was a tan miia and Q was six spart and I 
said. OK. I got baek-to-back tan miias. 

And thaf^ ah, four, thraa, four, six was a mila tima, thfso ap9!t batwaan 
forty^hnK and forty six. 

Than ah, savtn, ona, six was a thrsa thousand matar add-'aiTHjp. 
Than ah, two aight four was an aga. 
i had back-lo-back fours, H was just four, four, four, was amiie ttma. 
Than nina, oh, two, Iva was a two miia 

andninat^'<^nin•ohw•sahvomi)•. I sak!, OK, 3 had nina, thraa, nine, 
thraa bafom On a praviousiy prasantad 1st], this is just nina , thraa, nina. oh. 
That was back-to-back two miies. 

'rhanthathiidonawasatwomiia,soi|^thrBatwomi)asinarowharB. ft 
wu aH)ht fiv«.IN«.aight and ft was fror. ./anls and backwards. 

Than. ah. two, aight ffva was an aga. i sakj OK, f had, i ji^ had two. aight, 
four. Thisistwo,aightilva.onatanthofayaaroidar. 

Than, ah. taven, sbc two was an aga. idkin'treally do anything with that 

Andthonaight.six,ninawasanag8. i said OK, it's almost eighty-seven, t 
just saM OK, i got two back-fo-back«^es that ireaiiywasnl crazy about I 
just wanted to rshear&d and gat trxk to them as fast as possible. 

And then the four, thrae, nina. three was a nHla. I saM flfly apart. 

And then fcur, five. four, eight was a mile and I saki t^iere was three apart 
between those. 
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Table 5 



Verbal Protocd: OD's Encoding of a 50-digit List 



Group DD's Report 

3785 Rr»t group, uiTJ,th«whcte«rtt four groups of feur, It iL««t\««mtwage^ 

milo. miit. two agM. and tho miltt \Mre simiiar and th« two agM 
similar, ao I Just wa:^ tat on tttat imaani was In gn«t shape. Soitwas 
twn agas, thay wars two apart, first group. 

6307 Thanthamiia.JustaRttlaovaraixandahalfminutasisaid. 

6261 And ttianaxtona, six, two, six. ona. I said, OK, ITS faster and It's sbcty-two 

sb(ty-ona, and It's one apart iMtwaen the sixty-two and sixtyona. 

8871 And tfien the last one was eigtrty^ht. seventy-one. ITS two ages and! 

Just, I didnt figurs any age dHferenee, Ixjt i knew that the first age was in 
the eighties and my first group was ages in the eighties, so i was OK with all 

ofthat 

420 Then.ah,iour*twentywasamlle. 1 Just said four twenty fiat, that was easy 

enough, a good school mile. 
799 Seven, nine, nine was an age. i said it was almost eighty years old. 

810 And eight ten was a two mile, i just said ifs a rsaiiy fast two mile. 

6938 Then sixty-nine, thirty-eight was a ten mile, and i Just said It was up there. It 

was Ute a rsaliy slow ten mile. 
5802 And then, rh, fifty-eight oh two was another ten miie and I said, OK, ifs 

almost Ifty-eight minutes, ifs, Ifs a good pace ten mile. 

3798 Then uh, thirty-seven ninety-eight was a 1 0K. H wasn't a legitimate 10K, 

txjt i Just rememi>er saying OK, if s almost thirty-eight minutes, if you tNni( 

about it l«s that 

063 And then ah«., oh, six, three was an age. i said It was. ah, 8ka fight around 

retirement age. 

142 And one forty-two was a half mile, i said It right around world record half 

miie. 

886 And then eight, eight six was an age. i didn't really do much with that 

because then all of a sudden i had badt-to-back sixes, so i inked those two 
up. 

6933 and It was sixty-nine, thirty-^ree, another ten mile. 
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Table 6 



DD's Semantic Coding Structures 



Coding Structure Exampie 

Three<Hgit groups 

Time 3:52 

Time + Deumal 56.4 

Age + Dedmal 79.9 

"0" + Time 049 

"0" + Age 063 

Misc Pattern 111 



Four-digit groups 

Time 49:27 

Time + Decimal 9:02.5 

Age + Age 8785 

"O'^ + Three-digit code 02:04 

Date 1955 

Misc Pattern 9876 

Misc Pattern + Decimal 963.2 



Table 7 



Category 



1/4m 


497 


1/2nn 


142 


3/4m 


315 


1m 


420 


3k 


716 


2m 


S2d 


3m 


iHo 


10k 


■ 2£i4 


10m 


4753 


Date 


1800 


Age 


284 


Misc. 


987 
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Tables 



DD's Sfrial Recall Performanct as a Function olUst Typa 



UttTypt 



Enrichad ilisfii 



M.8 M.8 

80 08 4.4 

t 1.095^ 

Hi^isifsrt Tiffin 

% jm 29.1 48.9 

80 ' 13.8 19.4 



2&0 48.0 

t 4.09r' 

• 

RsCTlTlin# 

mm 43.3 66.7 

sr. 27.5 51.5 

r/Mk.t) 94J{ 54.S 

i 1.972* 

Total RtcaBTlfW 



72.4 115.6 
80 37.4 63.1 



57.5 101J 
t 2.877- 



Wof. Alt«rtion»^aiM.df«46. *e<.05;**e<.01;***£<.001. TSntMivporttdinMCondK. 
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Table 9 



QQ*t Strattglts for 1 X 2s 



Condition 



oral 



Mean RT 

Proportion of Trtato 



Monaty 

230 
11.1 



RtWval Qwaalnfl Cricutotten 
303 430 830 

»^ 88.0 8.1 



Visual 



MmrRT 

Propoftion of Trials 



527 
11.1 



831 
24.3 



728 
81.0 



1014 
3.8 



No(t: TlmMgivoninmNtaoondt. 
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